cohere-training-data-crawler User-Agent & Blocking Rules

Operated by Cohere. Verify the official user-agent string and safety status for this bot. Use the tools below to run a live crawlability test and get copy-paste robots.txt rules to manage its access. Crawler info last updated on .

AI Bots Caution

What is cohere-training-data-crawler?

Explicitly named crawler by Cohere for gathering training datasets for their Large Language Models.

Operator Information

Operator

Cohere

Official Documentation

View Documentation

Crawler data update history

Added on

Last updated on

Test if cohere-training-data-crawler can access your URL

Enter URL to test if cohere-training-data-crawler is allowed or blocked.

https://
cohere-training-data-crawler is  ...

Is cohere-training-data-crawler safe?

It is recommended to use caution with cohere-training-data-crawler.

Caution bots may have legitimate uses but can be resource-intensive or used for competitive intelligence.

For more information about our safety badges, visit our documentation.

User-Agent String used by cohere-training-data-crawler

cohere-training-data-crawler

How to block cohere-training-data-crawler with robots.txt?

Add the following standard robots.txt rule to prevent cohere-training-data-crawler from accessing your site.

User-agent: cohere-training-data-crawler
Disallow: /

What happens if I block cohere-training-data-crawler?

Low Impact

Blocking cohere-training-data-crawler will prevent this AI bot from accessing your content. This will NOT affect your search rankings. Consider blocking if you're concerned about AI training or data usage.

Common use cases for cohere-training-data-crawler

Understanding how cohere-training-data-crawler is typically used can help you make informed decisions about whether to allow or block it on your website.

  • Collects web content for AI model training
  • Gathers data to improve language understanding
  • Scrapes text, code, and other content for datasets
  • May be used for chatbot and AI assistant development
  • Typically does NOT affect search engine rankings