archive.org_bot User-Agent & Blocking Rules

Operated by Internet Archive. Verify the official user-agent string and safety status for this bot. Use the tools below to run a live crawlability test and get copy-paste robots.txt rules to manage its access. Crawler info last updated on .

Other Agents Safe

What is archive.org_bot?

The crawler for the Internet Archive (Wayback Machine). It preserves the history of the web.

Operator Information

Official Documentation

Not available

Crawler data update history

Added on

Last updated on

Test if archive.org_bot can access your URL

Enter URL to test if archive.org_bot is allowed or blocked.

https://
archive.org_bot is  ...

Is archive.org_bot safe?

Yes, archive.org_bot is considered safe.

This bot is verified and generally respected by the SEO community.

For more information about our safety badges, visit our documentation.

User-Agent String used by archive.org_bot

archive.org_bot

How to block archive.org_bot with robots.txt?

Add the following standard robots.txt rule to prevent archive.org_bot from accessing your site.

User-agent: archive.org_bot
Disallow: /

What happens if I block archive.org_bot?

Unknown Impact

The impact of blocking archive.org_bot is not well documented. Review the crawler's official documentation and test in a staging environment before blocking in production.

Common use cases for archive.org_bot

Understanding how archive.org_bot is typically used can help you make informed decisions about whether to allow or block it on your website.

  • Performs specialized web crawling tasks
  • May be used for research or data collection
  • Purpose varies depending on the specific bot
  • Check official documentation for details
  • Impact of blocking depends on your use case