img2dataset User-Agent & Blocking Rules

Operated by Open Source Community. Verify the official user-agent string and safety status for this bot. Use the tools below to run a live crawlability test and get copy-paste robots.txt rules to manage its access. Crawler info last updated on .

AI Bots Aggressive

What is img2dataset?

A tool often used by researchers and developers to scrape images and create large-scale image-text datasets for AI training.

Operator Information

Official Documentation

Not available

Crawler data update history

Added on

Last updated on

Test if img2dataset can access your URL

Enter URL to test if img2dataset is allowed or blocked.

https://
img2dataset is  ...

Is img2dataset safe?

No, img2dataset is generally considered aggressive.

This bot typically consumes server resources without providing search ranking value.

For more information about our safety badges, visit our documentation.

User-Agent String used by img2dataset

img2dataset

How to block img2dataset with robots.txt?

Add the following standard robots.txt rule to prevent img2dataset from accessing your site.

User-agent: img2dataset
Disallow: /

What happens if I block img2dataset?

Low Impact

Blocking img2dataset will prevent this AI bot from accessing your content. This will NOT affect your search rankings. Consider blocking if you're concerned about AI training or data usage.

Common use cases for img2dataset

Understanding how img2dataset is typically used can help you make informed decisions about whether to allow or block it on your website.

  • Collects web content for AI model training
  • Gathers data to improve language understanding
  • Scrapes text, code, and other content for datasets
  • May be used for chatbot and AI assistant development
  • Typically does NOT affect search engine rankings