# Maintainer note: hand-edited, served as-is by Cloudflare Pages. # The Sitemap: directive below must stay in sync with /sitemap.xml. # # IRWFA — International Rehabilitation Workforce Alliance # https://irwfa.org/ · members@irwfa.org # # Policy: this site is public and welcomes search engines AND AI / answer # engines to crawl, index, and cite it. Citation = visibility, and we want # the entity (IRWFA) understood and surfaced correctly. Nothing is disallowed. # # Content Signals (Cloudflare · ContentSignals.org) — what may be done AFTER fetching: # search = build a search index and show results # ai-input = use this content as input for AI answers (RAG / grounding) # ai-train = use this content to train or fine-tune AI models # IRWFA permits all three: our public knowledge is a commons, to be surfaced and cited. User-agent: * Content-Signal: search=yes, ai-input=yes, ai-train=yes Allow: / # --- Search engines (explicit, though covered by * above) --- User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: DuckDuckBot Allow: / User-agent: Applebot Allow: / # --- OpenAI --- User-agent: GPTBot Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ChatGPT-User Allow: / # --- Anthropic (Claude) --- User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: Claude-User Allow: / User-agent: Claude-SearchBot Allow: / User-agent: anthropic-ai Allow: / # --- Google AI (Gemini / Vertex training & grounding) --- User-agent: Google-Extended Allow: / User-agent: GoogleOther Allow: / # --- Apple Intelligence --- User-agent: Applebot-Extended Allow: / # --- Perplexity --- User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / # --- Microsoft / Copilot --- User-agent: Bingbot Allow: / # --- Amazon --- User-agent: Amazonbot Allow: / # --- Meta AI --- User-agent: Meta-ExternalAgent Allow: / User-agent: meta-externalagent Allow: / User-agent: FacebookBot Allow: / # --- ByteDance / TikTok --- User-agent: Bytespider Allow: / # --- Cohere --- User-agent: cohere-ai Allow: / User-agent: cohere-training-data-crawler Allow: / # --- Mistral --- User-agent: MistralAI-User Allow: / # --- xAI (Grok) --- User-agent: xAI-Bot Allow: / # --- DuckDuckGo AI (DuckAssist) --- User-agent: DuckAssistBot Allow: / # --- Allen Institute for AI (OLMo) --- User-agent: AI2Bot Allow: / # --- Others (answer engines, indexes, training crawlers) --- User-agent: YouBot Allow: / User-agent: Diffbot Allow: / User-agent: PetalBot Allow: / User-agent: Timpibot Allow: / User-agent: Omgilibot Allow: / User-agent: ImagesiftBot Allow: / User-agent: CCBot Allow: / # --- Moonshot AI (Kimi) --- User-agent: Moonshot-AI Allow: / User-agent: Kimi Allow: / # --- International search engines (global reach) --- User-agent: YandexBot Allow: / User-agent: Baiduspider Allow: / User-agent: Yeti Allow: / User-agent: SeznamBot Allow: / User-agent: Sogou web spider Allow: / # --- Google verticals --- User-agent: Googlebot-Image Allow: / User-agent: Googlebot-News Allow: / Sitemap: https://irwfa.org/sitemap.xml Sitemap: https://irwfa.org/news-sitemap.xml Sitemap: https://irwfa.org/image-sitemap.xml