Diffbot User-Agent & Blocking Rules
Operated by Diffbot. Verify the official user-agent string and safety status for this bot. Use the tools below to run a live crawlability test and get copy-paste robots.txt rules to manage its access. Crawler info last updated on .
What is Diffbot?
Diffbot uses computer vision and NLP to extract structured data from web pages.
Test if Diffbot can access your URL
Enter URL to test if Diffbot is allowed or blocked.
Is Diffbot safe?
It is recommended to use caution with Diffbot.
Caution bots may have legitimate uses but can be resource-intensive or used for competitive intelligence.
For more information about our safety badges, visit our documentation.
User-Agent String used by Diffbot
DiffbotHow to block Diffbot with robots.txt?
Add the following standard robots.txt rule to prevent Diffbot from accessing your site.
User-agent: Diffbot Disallow: /
What happens if I block Diffbot?
Low Impact
Blocking Diffbot will prevent automated scraping of your content. This is generally safe and recommended to protect your content and reduce server load. Your search rankings will NOT be affected.
Common use cases for Diffbot
Understanding how Diffbot is typically used can help you make informed decisions about whether to allow or block it on your website.
- Extracts content, prices, or data from websites
- May be used for competitive intelligence
- Often operates without explicit permission
- Can increase server load and bandwidth usage
- May violate terms of service or copyright