netarchivesuite / so-me

Social Media harvests
Apache License 2.0
8 stars 0 forks source link

Tweet-JSON packed in WARCs should be compressed #9

Closed tokee closed 2 years ago

tokee commented 4 years ago

Currently the scripts stores the JSON-blocks for the tweets in uncompressed form. This should be changed to compressed, so that .warc.gz-files are produced.

tokee commented 2 years ago

Implemented on the logs branch long ago.