Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
zamadatix
27d ago
0 comments
Save
Share
Common Crawl doesn't publish raw DNS separately, you have to pull the information out of the aggregate database. The WARC-IP-Address header should record the IP Common Crawl connected to for the site.
0 comments
1 comments · 1 top-level
top
newest
oldest
ccgreg
26d ago
Good timing, I'm about to release that dataset.
j
/
k
navigate · click thread line to collapse