Diffbot User-Agent & Blocking Rules

Operated by Diffbot. Verify the official user-agent string and safety status for this bot. Use the tools below to run a live crawlability test and get copy-paste robots.txt rules to manage its access. Crawler info last updated on .

Scrapers Caution

What is Diffbot?

Diffbot uses computer vision and NLP to extract structured data from web pages.

Operator Information

Operator

Diffbot

Official Documentation

Not available

Crawler data update history

Added on

Last updated on

Test if Diffbot can access your URL

Enter URL to test if Diffbot is allowed or blocked.

https://
Diffbot is  ...

Is Diffbot safe?

It is recommended to use caution with Diffbot.

Caution bots may have legitimate uses but can be resource-intensive or used for competitive intelligence.

For more information about our safety badges, visit our documentation.

User-Agent String used by Diffbot

Diffbot

How to block Diffbot with robots.txt?

Add the following standard robots.txt rule to prevent Diffbot from accessing your site.

User-agent: Diffbot
Disallow: /

What happens if I block Diffbot?

Low Impact

Blocking Diffbot will prevent automated scraping of your content. This is generally safe and recommended to protect your content and reduce server load. Your search rankings will NOT be affected.

Common use cases for Diffbot

Understanding how Diffbot is typically used can help you make informed decisions about whether to allow or block it on your website.

  • Extracts content, prices, or data from websites
  • May be used for competitive intelligence
  • Often operates without explicit permission
  • Can increase server load and bandwidth usage
  • May violate terms of service or copyright