Less Is More: How To Find Duplicate and Low Value Archives with XPath & Custom Extraction


Posted April 9, 2023 by davids6981172

Less Is More: How To Find Duplicate and Low Value Archives with XPath & Custom Extraction
 
Less Is More: How To Find Duplicate and Low Value Archives with XPath & Custom Extraction
In this article, I outline a method to identify elements of duplicate content to improve the quality and focus of your listing pages. This is imperative for enhanced Google ranking, resulting in potential customers speedily finding what they want, and sales conversions. This process encompasses empty and low-quality taxonomies, such as product listing pages, blog categories, and tags. E-commerce businesses often have multiple products, categories, and sub-categories, many of which do not add value. It is preferable to have a smaller number of targeted pages which rank for the relevant query, thereby enabling potential customers to find your products, and trust you are a reliable merchant. Below is an example of a store where a category page contains only one product instead of a range of oral hygiene stock. Searching customers are provided a limited choice and are likely to search elsewhere. https://www.argos.co.uk/browse/health-and-beauty/dental-care/teeth-whitening/c:29234/ The same applies to blog websites with drastic outcomes. A publisher can create many articles a month with different areas of interest. However, low-value pages with limited content will struggle to rank for terms that potential audiences are searching for and therefore will not attract leads. Below is an example with just two posts. To make matters worse, the same articles are assigned numerous taxonomies resulting in potential duplication. https://contentmarketinginstitute.com/tertiary-category/branded-content/ As you can see, both the e-commerce listing page and archive page contain little content, and that doesn't help anyone. These examples show why you should review your site and ensure you don’t have something similar. Duplication is Easy If you have an online business or blog website, generating duplicate or low-value pages is easy; this is especially true if your website is not audited often. Therefore, when publishing a post or product on your site, I recommend this simple check-list below: How many categories did you assign to your new post/product? How many tags did you add to your post/product? Did you check the spelling of your tags? Did you keep them consistent, with singular/plural, hyphen if multiple words, etc.? Did you train your editors and e-commerce managers? Do you have a solid taxonomy strategy? Did you perform solid keyword research? Many of the website sites I have audited didn’t do this. That is why in this article, I delineate a method which will help you find archive or listing pages with little or no content, be it posts or products, and define what action is required to resolve this. Duplicates Can Be Automatically Generated Easily It is really easy to unconsciously generate duplicate and low-quality pages, especially with blogging platforms like WordPress, where assigning a category or a tag to a post is a one-click action. I commonly find auto-generated duplicates while auditing web websites. Left alone, they can be harmful. A blog post with multiple and similar tags, as well as categories, automatically generates an archive for each of them.
Visit : https://new.allinclusive.agency/search-engine-optimization-service
-- END ---
Share Facebook Twitter
Print Friendly and PDF DisclaimerReport Abuse
Contact Email [email protected]
Issued By [email protected]
Country United States
Categories Business
Last Updated April 9, 2023