XML Sitemap
Also called Sitemap.xml, Sitemap file.
An XML sitemap is a file that lists a website's important URLs in a structured format, giving search engines a reliable list of pages to discover and crawl — particularly useful for large sites or pages that lack strong internal links.
What it means
An XML sitemap is a roadmap for search engine crawlers. It lists URLs in a structured XML format, with optional metadata about each URL including its last-modified date, change frequency, and relative priority. Submitted to Google via Search Console and referenced in robots.txt, it helps crawlers discover URLs they might otherwise find slowly or miss entirely.
Sitemaps matter most in two scenarios: newly-launched sites where internal linking hasn't built a strong crawl path yet, and large sites — ecommerce, news publishers, programmatic pages — where URL discovery through links alone would be slow or incomplete. For small sites with strong internal linking, the practical impact is modest.
Common sitemap mistakes include listing noindex pages, non-canonical URLs, soft 404s, and redirected URLs. The sitemap should include only canonical, indexable, 200-status pages — anything else sends conflicting signals to Google. Dynamic sitemaps that auto-update as content is added (as Next.js generates via sitemap.ts) are strongly preferred over static files that go stale.
Key takeaways
- Include only canonical, indexable, 200-status pages in the sitemap
- Submit sitemaps via Google Search Console for faster discovery and crawl monitoring
- Dynamic sitemaps that auto-update with new content are preferred over static files
- Priority and changefreq attributes are largely ignored by Google but can still be declared
- Sitemap.xml
- Sitemap file
Last updated . Spotted something wrong? Let us know.
Keep digging.
Technical SEO
Also: Technical optimizationTechnical SEO is the practice of optimizing a website's infrastructure — crawlability, indexation, site speed, structured data, and URL architecture — so search engines can efficiently discover, parse, and rank it.
Technical SEORead moreCrawl Budget
Crawl budget is the number of pages Googlebot will crawl on your site in a given timeframe, determined by your site's crawl capacity and Google's perceived demand to crawl it.
Technical SEORead moreIndexation
Also: Indexing, IndexabilityIndexation is the process by which a search engine adds a page to its searchable index after crawling it — and indexability is the property of being eligible for that process.
Technical SEORead moreRobots.txt
Also: Robots exclusion protocol, Robots exclusion standardA robots.txt file is a plain-text file at a domain's root that tells search engine crawlers which pages or directories they are and aren't allowed to crawl.
Technical SEORead more
Knowing what XML Sitemap is, is the easy part.
Implementing it on your site is what moves the needle. Get a free SEO audit and we’ll show you where xml sitemap fits in your roadmap.