commoncrawl / cc-pyspark

Process Common Crawl data with Python and Spark
MIT License
406 stars 86 forks source link

Work-around to enable support of WARC/1.1 in warcio #3

Closed sebastian-nagel closed 2 years ago

sebastian-nagel commented 6 years ago

cf. webrecorder/warcio#37