Closed guillaumepitel closed 7 years ago
I know that I'm picky… sorry. I would prefer to have a PR for the bugfix (wrong https handling) that I'm ready to merge, and a spearate PR for the feature enanchement (MultiWarc), that will take me some time to review.
If you have time to do the split, I'll merge the PR. But since you have helped so much, if you don't have time I'll take the bugfix diff from your fork and commit it myself.
Let me know what you would prefer. Thanks again, Massimo.
Added new store class (MultiWarc), to store output of crawlers into a succession of files, instead of just one big always growing WARC