How To Search And Delete Internal Duplicate Pages When Optimizing Sites. Such a phenomenon as internal duplicate pages on a site can very negatively affect the promotion of this site in search engines because in this case the link weight and relevance of the pages being promoted are reduced. In addition, the presence of internal duplicate pages can reduce the overall uniqueness of the content on the site, because the same text appears on several pages at once. Website optimization involves the search and removal of internal duplicate pages on a site as one of the main tasks.
Internal Duplicate Pages.
Internal duplicate pages are full (clear) and partial (fuzzy), they usually appear due to the features used on the CMS website. Also, the appearance of duplicates can be the actions of an inexperienced webmaster, deliberately copying texts on the pages of a site or creating identical pages.
To eliminate the negative impact of duplicate pages on website promotion in search engines, you must try to find all duplicates on the website and “close” them for search engines, for example, using the robots.txt file. There are several ways to find such pages.
So, you can search for duplicates manually by analyzing all pages indexed by search engines. To do this, enter the query “site: HTTP: //site_name.com” in the search bar of Yandex or Google and look at each result of the search results. If you find atypical URLs for pages, you can conclude that these pages are duplicates – you must manually prohibit them from being indexed by search engines.
Another way to find internal duplicates of pages on a site is to use special programs such as Xenu. This program analyzes all the links on the site and shows which of them are working and which are not. In the Xenu program, you can get a complete list of site page addresses in the form of a table, in which it is then convenient to analyze data on the presence of duplicates: if among the page addresses there are only a few characters differing, then they are worth checking.
If the site is added to Google Webmaster, then, in this case, the search for internal duplicate pages on the site can be carried out using it. In the webmaster’s menu you need to find such an item as “Optimization”, and in it – “HTML optimization”. This section provides data on duplicate page titles and descriptions – the most likely signs of duplication of pages on a site. You need to analyze pages with duplicate titles and descriptions and, if possible, remove those that are redundant.
How Can I Remove Internal Duplicates On A Site From A Search Engine Index?
The most affordable way is to manually remove duplicate pages from the site (if possible). With subsequent indexing, search engines will no longer take into account deleted pages.
You can also remove duplicate pages from search indexing using the so-called 301 redirect – a standard way to “glue” identical documents. For example, a 301 redirect is used in cases where you need to “glue” pages that are available with or without www.
Editing A Robots.Txt File.
Editing a robots.txt file is another way to remove duplicate pages from indexing. You can search for ready-made robots.txt file settings for specific content management systems. In this case, everything unnecessary for indexing by search engines is already “closed” by the disallow directives, and it remains only to add the customized robots.txt file to the site. If necessary, you can edit the robots.txt file manually, however, for this, you must be able to use its directives.
Thus, the methods for searching and deleting internal duplicates of pages are diverse, and you can optionally use any of them or all in combination. However, it should be noted that the inept removal of duplicate pages can lead to even worse consequences than the presence of duplicates on the site, therefore it is best to entrust the work on optimizing the site in this direction to professionals.
At GCC MARKETING web design company, professional website optimization is performed by experienced professionals. The list of our services for optimizing sites and internal events, such as the elimination of technical errors in the code of the site or the optimization of meta tags, as well as external events: registering the site in directories, working with external links.
Order the optimization of your website in the GCC MARKETING web design company right now, so that your website can work more efficiently today!