SEO Configuration - Sitemap & Robots.txt

📍 Files Location

robots.txt

Location: static/robots.txt
Production URL: https://xerago.ai/robots.txt
Purpose: Controls search engine crawling behavior

sitemap.xml

Generated automatically by Docusaurus during build
Production URL: https://xerago.ai/sitemap.xml
Location after build: build/sitemap.xml

🤖 robots.txt Configuration

The robots.txt file is configured to:

✅ Allow all search engines to crawl all pages
✅ Reference the sitemap location
✅ Include placeholders for blocking specific bots (if needed)
✅ Include crawl-delay option (commented out by default)

Current Configuration:

User-agent: *
Allow: /
Sitemap: https://xerago.ai/sitemap.xml

To Block Specific Pages:

Uncomment and modify these lines in static/robots.txt:

Disallow: /admin/
Disallow: /private/

To Block Specific Bots:

Add these lines to static/robots.txt:

User-agent: BadBotName
Disallow: /

🗺️ Sitemap Configuration

Configured in docusaurus.config.js:

sitemap: {
  changefreq: 'weekly',      // How often pages change
  priority: 0.5,             // Default priority (0.0 to 1.0)
  ignorePatterns: ['/tags/**'], // Patterns to exclude
  filename: 'sitemap.xml',   // Output filename
}

Sitemap Settings Explained:

changefreq: 'weekly'
- Tells search engines how often to check for updates
- Options: always, hourly, daily, weekly, monthly, yearly, never
priority: 0.5
- Default priority for all pages (0.0 = lowest, 1.0 = highest)
- Homepage typically gets 1.0, other pages 0.5-0.8
ignorePatterns
- Excludes specific URL patterns from sitemap
- Currently excludes tag pages: /tags/**
filename: 'sitemap.xml'
- Standard filename for sitemaps

🔍 How to Verify

After Building for Production:

Build the site:
```
npm run build
```
Check robots.txt:
```
# File should exist at:
build/robots.txt
```

Check sitemap.xml:

# File should exist at:
build/sitemap.xml

Serve locally to test:
```
npm run serve
```
Then visit:
- http://localhost:3000/robots.txt
- http://localhost:3000/sitemap.xml

🚀 Production Deployment

After deploying to production, verify:

robots.txt is accessible:
- Visit: https://xerago.ai/robots.txt
- Should display the robots.txt content
sitemap.xml is accessible:
- Visit: https://xerago.ai/sitemap.xml
- Should display XML sitemap with all pages
Submit to Search Engines:

Google Search Console:
- Go to: https://search.google.com/search-console
- Add property: https://xerago.ai
- Submit sitemap: https://xerago.ai/sitemap.xml
Bing Webmaster Tools:
- Go to: https://www.bing.com/webmasters
- Add site: https://xerago.ai
- Submit sitemap: https://xerago.ai/sitemap.xml

📊 Sitemap Contents

The sitemap will automatically include:

✅ All MDX pages in src/pages/

/ (home)
/about-us
/blog/*
/customer-stories/*
/solutions/*
etc.

✅ All documentation pages in docs/

❌ Excluded patterns:

/tags/** (tag pages)
Any patterns added to ignorePatterns

🛠️ Customization Options

Change Update Frequency for Specific Pages:

You can set different priorities in page frontmatter:

---
title: About Us
description: Learn about Xerago
# SEO customization
sitemap:
  changefreq: daily
  priority: 0.9
---

Exclude Specific Pages:

Add to docusaurus.config.js:

sitemap: {
  ignorePatterns: [
    '/tags/**',
    '/admin/**',
    '/private/**',
  ],
}

✅ Checklist

robots.txt created in static/ folder
Sitemap configuration added to docusaurus.config.js
Sitemap references correct production URL
Test robots.txt after build (build/robots.txt)
Test sitemap.xml after build (build/sitemap.xml)
Submit sitemap to Google Search Console (after deployment)
Submit sitemap to Bing Webmaster Tools (after deployment)
Monitor crawl errors in search console

📝 Notes

Docusaurus automatically generates sitemap.xml during the build process
The sitemap is regenerated on every build with updated content
robots.txt is a static file and won't change unless you edit it
Both files will be available at the root of your production site

📍 Files Location​

robots.txt​

sitemap.xml​

🤖 robots.txt Configuration​

Current Configuration:​

To Block Specific Pages:​

To Block Specific Bots:​

🗺️ Sitemap Configuration​

Sitemap Settings Explained:​

🔍 How to Verify​

After Building for Production:​

🚀 Production Deployment​

📊 Sitemap Contents​

🛠️ Customization Options​

Change Update Frequency for Specific Pages:​

Exclude Specific Pages:​

✅ Checklist​

📝 Notes​

🔗 Useful Links​