XML Sitemap
An XML sitemap is an XML-formatted file that lists the important URLs of a website to help search engines like Google discover, crawl, and index them more efficiently. Each entry contains a page's address and may include optional metadata: the date of last modification (lastmod), update frequency, or relative priority. An XML sitemap does not guarantee indexing, but it signals to crawlers which pages are worth visiting, which proves decisive for large, new, or poorly linked sites. It complements the robots.txt file, where its location is usually declared, and can be submitted directly through Google Search Console. A sitemap can also reference images, videos, or alternate language versions (hreflang), and split into multiple files linked by an index when the site exceeds 50,000 URLs or 50 MB uncompressed.
The XML sitemap is one of the simplest and most cost-effective technical levers for improving how search engines discover your content. Yet it is often misconfigured or left neglected.
How it works
An XML sitemap is a structured file, usually named sitemap.xml, placed at the root of the domain. Each URL is wrapped in a <url> tag containing at minimum a <loc> (the page address). The optional <lastmod>, <changefreq>, and <priority> tags provide additional hints, but Google no longer gives meaningful weight to the last two. The lastmod date, however, remains valuable: it signals that a page has been updated and deserves a fresh crawl.
You declare your sitemap in the robots.txt file via a Sitemap: line and submit it in Google Search Console to monitor its processing.
Why it matters
On a small site, internal linking is often enough to surface everything. But as volume grows, the sitemap becomes a tool for steering crawl budget: it directs crawlers toward canonical, useful pages and avoids wasting effort on secondary URLs.
Best practices
Keep your sitemap updated automatically on every publication, free of 404 or 301 URLs, and segment it by content type (pages, articles, products) if your site is large. For multilingual sites, embed hreflang annotations directly in the sitemap to help Google serve the right version per market.
Questions fréquentes
No. Google can discover your pages through internal and external links. The sitemap remains strongly recommended for large, new, or poorly linked sites, as it speeds up the discovery of priority URLs.
A single sitemap file is limited to 50,000 URLs and 50 MB uncompressed. Beyond that, you must create multiple sitemaps linked by an index file, which acts as a table of contents pointing to each sub-sitemap.
Termes & ressources liés
Une question sur votre visibilité IA ?
Score de visibilité IA de votre site. Gap analysis vs 3 concurrents directs. 5 optimisations prioritaires. Livré en PDF, sans engagement.
Réponse sous 24h · Sans engagement · contact@luwiz.io