internetarchive / warctools

Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
MIT License
152 stars 27 forks source link

warcvalid.py: Report exceptions #10

Closed pmyteh closed 10 years ago

pmyteh commented 10 years ago

warcvalid.py should report exceptions to stderr, as well as detected errors on reading records.

Previously, if a WARC file was broken and not reading effectively (perhaps due to gzip errors) then warcvalid would silently return -1 rather than printing errors. This gives a false sense of security to people using it from the command line.