ia_archiver
ia_archiver is assigned to the Search category by BotScope. Detection is based on the user-agent string. Current pattern:
(ia_archiver|archive\.org_bot|IABot/|Archive-It)
The UA name ia_archiver is the historical crawler identifier of the Internet Archive (archive.org / Wayback Machine). This legacy name is rarely used today — the current UA string is Mozilla/5.0 (compatible; archive.org_bot; Wayback Machine Live Record; +http://archive.org/details/archive.org_bot).
BotScope recognizes both forms (ia_archiver, archive.org_bot, IABot/, Archive-It) and groups them under this bot classification.
If you don't want your site to be archived, you can exclude both UA names in your robots.txt — see directive below. Note that Archive-It (the commercial variant used by libraries and institutions) partly ignores robots.txt by design.
Search engine crawlers index publicly available content for their results. They typically fetch /robots.txt first and respect crawl-delay and Disallow rules. High crawl frequency usually correlates with good visibility in those search results.
In the BotScope dashboard you see for ia_archiver: top paths, status distribution, byte volume and whether the user-agent signature stays stable over time or rotates.
robots.txt directive for ia_archiver
If you don't want ia_archiver to crawl your site, add the following block to your /robots.txt. This works only for bots that honour robots.txt — malicious crawlers ignore it.
User-agent: ia_archiver Disallow: /
This page is being enriched with detail knowledge about ia_archiver.