iipc / jwarc

Java library for reading and writing WARC files with a typed API
Apache License 2.0
46 stars 8 forks source link

GitHub 71 include revisits #75

Closed thomasegense closed 1 year ago

thomasegense commented 1 year ago
Add support for revisit lines in CDX-indexer

New option -r or --revisits-included to include revisit for the
CDX-tool.

The mime-type will be set to 'warc/revisit' to support PyWb.
(calendar entry as revisit)

The PR closes https://github.com/iipc/jwarc/issues/71
ato commented 1 year ago

Thanks. Released as 0.25.0