A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://developers.google.com/search/docs/crawling-indexing/canonicalization below:

What is URL Canonicalization | Google Search Central | Documentation

Stay organized with collections Save and categorize content based on your preferences.

What is canonicalization

Canonicalization is the process of selecting the representative –canonical– URL of a piece of content. Consequently, a canonical URL is the URL of a page that Google chose as the most representative from a set of duplicate pages. Often called deduplication, this process helps Google show only one version of the otherwise duplicate content in its search results.

There are many reasons why a site may have duplicate content:

Some duplicate content on a site is normal and it's not a violation of Google's spam policies. However, having the same content accessible through many different URLs can be a bad user experience (for example, people might wonder which is the right page, and whether there's a difference between the two) and it may make it harder for you to track how your content performs in search results.

How Google indexes and chooses the canonical URL

When Google indexes a page, it determines the primary content (or centerpiece) of each page. If Google finds multiple pages that seem to be the same or the primary content very similar, it chooses the page that, based on the factors (or signals) the indexing process collected, is objectively the most complete and useful for search users, and marks it as canonical. The canonical page will be crawled most regularly; duplicates are crawled less frequently in order to reduce the crawling load on sites.

There are a handful of factors that play a role in canonicalization: whether the page is served over HTTP or HTTPS, redirects, presence of the URL in a sitemap, and rel="canonical" link annotations. You can indicate your preference to Google using these techniques, but Google may choose a different page as canonical than you do, for various reasons. That is, indicating a canonical preference is a hint, not a rule.

Different language versions of a single page are considered duplicates only if the primary content is in the same language (that is, if only the header, footer, and other non-critical text is translated, but the body remains the same, then the pages are considered to be duplicates). To learn more about setting up localized sites, see our documentation about managing multi-lingual and multi-regional sites.

Google uses the canonical page as the main source to evaluate content and quality. A Google Search result usually points to the canonical page, unless one of the duplicates is explicitly better suited for a search user. For example, the search result will probably point to the mobile page if the user is on a mobile device, even if the desktop page is the canonical.

Read more about how to indicate your preference for the canonical URL, and whether you need to.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-03-06 UTC.

[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-06 UTC."],[[["Canonicalization is the process of choosing the best URL from a set of duplicate pages on a website."],["Google uses signals like HTTPS, sitemaps, and redirects to determine the canonical URL, aiming to show users the most relevant and complete version of a page."],["While website owners can suggest a preferred canonical URL, Google's algorithms may ultimately select a different URL based on various factors."],["Duplicate content arising from regional or device variations is common and not inherently problematic, but managing it can improve user experience and search performance."],["Google primarily uses the canonical version for content evaluation and search results, but may prioritize other versions (e.g., mobile) based on user context."]]],["Canonicalization is the process of selecting a representative URL for duplicate content. Google chooses the most complete and useful page as the canonical URL, indexing it more regularly. Duplicate pages may arise from region, device, protocol variants, site functions, or accidents. Factors like HTTP/HTTPS, redirects, sitemaps, and `rel=\"canonical\"` annotations influence Google's choice, though it can differ from site preferences. The canonical page is the primary source for content evaluation unless a duplicate better serves a user's specific context.\n"]]


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4