iipc / jwarc

Java library for reading and writing WARC files with a typed API
Apache License 2.0
48 stars 9 forks source link

IoException reading gzip extra, fixes #14 #15

Closed sebastian-nagel closed 4 years ago

sebastian-nagel commented 4 years ago

Open question: shall the compressed record length (given in the extra field "sl") be used? Probably not.

ato commented 4 years ago

Ah, whoops. Thanks.

shall the compressed record length (given in the extra field "sl") be used?

I wouldn't say no to a patch that adds support for it. It is a pretty handy extension.

ato commented 4 years ago

Released as 0.8.2. It should sync to Maven Central in a couple of hours.

sebastian-nagel commented 4 years ago

Thanks! The idea to use the "sl" field to skip over records is tracked in #16 (so, it does not get forgotten).