Now keep in mind Illyes does not say these are all indexed and ranking – merely that 60% of the URLs Google knows about are duplicate. But this is still an incredible number – more than half of them are duplicate!
We see Google regularly kick duplicates out of the search results – completely if it is a case of hardcore spam – or at least to not show them until someone clicks on the link at the bottom of the search results to reveal duplicates. There are also duplicates that get removed via a DMCA filed with Google, to remove stolen content from the search results.
Google doesn’t tend to release crawl stats very often, so it is nice to hear that there are 140 trillion URLs that Google knows about, as well as the fact 60% of those known URLs are duplicate.
— Jennifer Slegg (@jenstar) November 16, 2015
Latest posts by Jennifer Slegg (see all)
- JSON-LD Google’s Preferred Structured Data Markup - October 21, 2016
- How Google Handles Site Moves When Old Domain Cannot Be Redirected - October 20, 2016
- Exit Page Interstitials Not Impacted by Google’s Mobile Interstitial Change - October 19, 2016
- Google: Leave 301 Redirects in Place at Least a Year, Preferably Longer - October 19, 2016
- Site Wide & Blogroll Links Are Not Automatically Bad Links for Google - October 19, 2016