archive.org_bot User-Agent & Blocking Rules
Operated by Internet Archive. Verify the official user-agent string and safety status for this bot. Use the tools below to run a live crawlability test and get copy-paste robots.txt rules to manage its access. Crawler info last updated on .
What is archive.org_bot?
The crawler for the Internet Archive (Wayback Machine). It preserves the history of the web.
Test if archive.org_bot can access your URL
Enter URL to test if archive.org_bot is allowed or blocked.
Is archive.org_bot safe?
Yes, archive.org_bot is considered safe.
This bot is verified and generally respected by the SEO community.
For more information about our safety badges, visit our documentation.
User-Agent String used by archive.org_bot
archive.org_botHow to block archive.org_bot with robots.txt?
Add the following standard robots.txt rule to prevent archive.org_bot from accessing your site.
User-agent: archive.org_bot Disallow: /
What happens if I block archive.org_bot?
Unknown Impact
The impact of blocking archive.org_bot is not well documented. Review the crawler's official documentation and test in a staging environment before blocking in production.
Common use cases for archive.org_bot
Understanding how archive.org_bot is typically used can help you make informed decisions about whether to allow or block it on your website.
- Performs specialized web crawling tasks
- May be used for research or data collection
- Purpose varies depending on the specific bot
- Check official documentation for details
- Impact of blocking depends on your use case