cohere-training-data-crawler User-Agent & Blocking Rules
Operated by Cohere. Verify the official user-agent string and safety status for this bot. Use the tools below to run a live crawlability test and get copy-paste robots.txt rules to manage its access. Crawler info last updated on .
What is cohere-training-data-crawler?
Explicitly named crawler by Cohere for gathering training datasets for their Large Language Models.
Test if cohere-training-data-crawler can access your URL
Enter URL to test if cohere-training-data-crawler is allowed or blocked.
Is cohere-training-data-crawler safe?
It is recommended to use caution with cohere-training-data-crawler.
Caution bots may have legitimate uses but can be resource-intensive or used for competitive intelligence.
For more information about our safety badges, visit our documentation.
User-Agent String used by cohere-training-data-crawler
cohere-training-data-crawlerHow to block cohere-training-data-crawler with robots.txt?
Add the following standard robots.txt rule to prevent cohere-training-data-crawler from accessing your site.
User-agent: cohere-training-data-crawler Disallow: /
What happens if I block cohere-training-data-crawler?
Low Impact
Blocking cohere-training-data-crawler will prevent this AI bot from accessing your content. This will NOT affect your search rankings. Consider blocking if you're concerned about AI training or data usage.
Common use cases for cohere-training-data-crawler
Understanding how cohere-training-data-crawler is typically used can help you make informed decisions about whether to allow or block it on your website.
- Collects web content for AI model training
- Gathers data to improve language understanding
- Scrapes text, code, and other content for datasets
- May be used for chatbot and AI assistant development
- Typically does NOT affect search engine rankings