C
CCBot (Common Crawl)
Common Crawl operates CCBot for the open web corpus. Official IPv4/IPv6 prefixes for CCBot are published in machine-readable JSON at index.commoncrawl.org; use that feed with the documented user-agent for allowlisting.
Autonomous systems
Network background
Egress addresses can change when Common Crawl updates infrastructure; prefer the published JSON and FAQ over static guesses.