commoncrawl / cc-index-table

Index Common Crawl archives in tabular format
Apache License 2.0
105 stars 9 forks source link

Improve extraction of host names and registered domains #26

Open sebastian-nagel opened 1 year ago

sebastian-nagel commented 1 year ago