CCBot (Common Crawl)

Common Crawl operates CCBot for the open web corpus. Official IPv4/IPv6 prefixes for CCBot are published in machine-readable JSON at index.commoncrawl.org; use that feed with the documented user-agent for allowlisting.

Autonomous systems

Network background

Egress addresses can change when Common Crawl updates infrastructure; prefer the published JSON and FAQ over static guesses.