In this week’s Google Webmaster Hangout, a question was raised about using rel canonical across an entire site, with each page using the rel canonical to point to itself, in order to prevent the possibility of duplicate content being indexed.
The question asked was “Is it still okay to put rel canonical on every single page pointing to itself, just in order to avoid duplicate parameters and things like that?”
John Mueller responded yes, and the person then clarified to ask if it matters if it is site wide across a million pages.
It doesn’t matter how many pages. You just need to make sure that it points to the clean URL version, that you’re not pointing to the parameter version accidentally, or that you’re not always pointing to the homepage accidentally, because those are the types of mistakes we try and catch.
You can do that across millions of pages and we’ll try to take that into account.
Sometimes URLs with URL parameters – such as parameters used for tracking or advertising – end up being indexed. And while duplicate content isn’t a penalty per say, there is the possibility it can cause issues, particularly if a URL parameter URL ends up being linked, which often happens when others link to a page they may have originally clicked through from a social media site or ad campaign. Other reasons we see this being used is for www versus non-www (although preferred domain in Google Search Console should definitely be used for this), pagination, depending on how your server handles upper and lower case characters within the URL, as well as for PageRank reasons.
That said, it isn’t necessary to use rel canonical across every page on a site. But if you make extensive use of parameters, it might make sense to do this site wide as a preventative measure.
Using rel canonical for duplicate content issues isn’t new, but it is nice to have the clarification that it wouldn’t cause issues even if deployed across millions of sites when each page is canonicalized to itself.
Google also released a blog post on best practices for using rel canonical a couple of years ago, detailing how to use to properly as well as some of the issues they see in the implementation.
Latest posts by Jennifer Slegg (see all)
- Google AdWords Truncating Headlines for Some Ads - July 22, 2016
- Google: How to Temporarily Remove Pages Varies by Length - July 22, 2016
- Google Flights Testing Thumbnail Ads in Search - July 20, 2016
- Google Voice Searches 30 Times More Likely to be Action Queries - July 19, 2016
- Google Looking for Sites to Review for Next Site Clinic - July 18, 2016