GEO Crawler Notes
Updated: 2026-04-21
This note records the crawler assumptions currently used in the ATIEMPPO site.
Why this exists
ATIEMPPO wants strong GEO / AEO visibility while keeping the site simple, public, and compatible with GitHub Pages.
The site therefore prioritizes:
- static HTML
- clean public URLs
- robots.txt at root
- llms.txt at root
- sitemap.xml
- structured data
Officially reviewed crawler controls
OpenAI
Official docs reviewed:
- https://developers.openai.com/api/docs/bots
- https://help.openai.com/es-419/articles/12627856-editores-y-desarrolladores-preguntas-frecuentes
Current interpretation:
OAI-SearchBotmatters for ChatGPT Search visibilityChatGPT-Usermatters for user-directed visitsGPTBotrelates to training / model-improvement crawling
Anthropic
Official docs reviewed:
- https://support.claude.com/en/articles/8896518-does-anthropic-crawl-data-from-the-web-and-how-can-site-owners-block-the-crawler
Current interpretation:
Claude-SearchBotmatters for search visibilityClaude-Usermatters for user-directed retrievalClaudeBotrelates to training-related crawling
Official docs reviewed:
- https://developers.google.com/crawling/docs/crawlers-fetchers/google-common-crawlers
Current interpretation:
Google-Extendedis the robots token for Gemini training / grounding preferences- it does not affect Google Search inclusion or ranking
xAI / Grok status
As of 2026-04-21, no first-party xAI crawler-control page was found with the same clarity as OpenAI, Anthropic, or Google for:
- a public open-web crawler token
- separate search vs training crawler controls
- explicit robots.txt guidance for third-party site owners
Because of that, ATIEMPPO does not currently add an explicit Grok / xAI
group to robots.txt.
This is intentional.
If xAI later publishes official crawler documentation, update:
robots.txt- this file
- any GEO checklist used by the project
Current ATIEMPPO decision
For now, the site is configured to maximize discovery and AI-search visibility.
Assumption in force:
- discovery and citation are the current priority
- no explicit training opt-out is enabled yet for OpenAI or Anthropic
If ATIEMPPO later wants a stricter policy, update these groups in robots.txt:
GPTBotClaudeBot