Closed ChillarAnand closed 2 years ago
Hi, @ChillarAnand! Thanks! Would you mind to change/extend the description. Otherwise let me know and I'll update it. Also to clarify that there are two ways to access CC data including the CDX index.
Updated readme page as well as description. Free free to update further if needed.
Thanks, @ChillarAnand!
AWS has removed public access for S3 files. Updated read me & script for the same.
Refer: https://commoncrawl.org/2022/03/introducing-cloudfront-access-to-common-crawl-data/