Ai2Bot-Dolma User-Agent & Blocking Rules

Operated by Allen Institute for AI. Verify the official user-agent string and safety status for this bot. Use the tools below to run a live crawlability test and get copy-paste robots.txt rules to manage its access. Crawler info last updated on .

AI Bots Caution

What is Ai2Bot-Dolma?

A specific AI2 crawler used to build the Dolma dataset, a massive open corpus for training language models.

Operator Information

Official Documentation

View Documentation

Crawler data update history

Added on

Last updated on

Test if Ai2Bot-Dolma can access your URL

Enter URL to test if Ai2Bot-Dolma is allowed or blocked.

https://
Ai2Bot-Dolma is  ...

Is Ai2Bot-Dolma safe?

It is recommended to use caution with Ai2Bot-Dolma.

Caution bots may have legitimate uses but can be resource-intensive or used for competitive intelligence.

For more information about our safety badges, visit our documentation.

User-Agent String used by Ai2Bot-Dolma

Ai2Bot-Dolma

How to block Ai2Bot-Dolma with robots.txt?

Add the following standard robots.txt rule to prevent Ai2Bot-Dolma from accessing your site.

User-agent: Ai2Bot-Dolma
Disallow: /

What happens if I block Ai2Bot-Dolma?

Low Impact

Blocking Ai2Bot-Dolma will prevent this AI bot from accessing your content. This will NOT affect your search rankings. Consider blocking if you're concerned about AI training or data usage.

Common use cases for Ai2Bot-Dolma

Understanding how Ai2Bot-Dolma is typically used can help you make informed decisions about whether to allow or block it on your website.

  • Collects web content for AI model training
  • Gathers data to improve language understanding
  • Scrapes text, code, and other content for datasets
  • May be used for chatbot and AI assistant development
  • Typically does NOT affect search engine rankings