Sitemap

Definition
A sitemap is a file that lists the pages of a website to help search engines discover and understand its structure. It provides metadata about each URL including when it was last updated, how often it changes, and its relative importance. Sitemaps can be submitted directly to search engines to ensure important content is indexed.
There are two main types. XML sitemaps are created for search engines, while HTML sitemaps are designed for human users to navigate a site. For example, an e-commerce site with thousands of product pages can use an XML sitemap to make sure all items are visible to Google, while an HTML sitemap helps visitors quickly find categories.
Advanced
XML sitemaps can contain up to 50,000 URLs per file. Large websites often split their sitemaps into multiple files and use a sitemap index to manage them. Search engines use these files as discovery aids but still rely on crawling and internal linking to evaluate page importance.
Advanced usage includes dynamically generating sitemaps for frequently updated content such as news or product listings. Tags such as <priority>, <lastmod>, and <changefreq> can be added, although search engines treat them as suggestions rather than strict instructions. Submitting sitemaps in Google Search Console or Bing Webmaster Tools provides visibility into indexing status and errors.
Why it matters
Use cases
Metrics
Issues
Example
A news publisher creates a dynamic XML sitemap that updates every hour with newly published stories. After submitting this sitemap to Google News, the publisher ensures rapid indexing. Articles appear in search results within minutes, driving timely traffic to the site.