hsiehsh168168 / warc-tools

Automatically exported from code.google.com/p/warc-tools
0 stars 0 forks source link

SRS 32 — The command line tool shall notify the user of any WARC-record's anomalies, missing required fields or incompatible fields types. #38

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
SRS 32 — The command line tool shall notify the user of any WARC-record's
anomalies, missing required fields or incompatible fields types.

Original issue reported on code.google.com by gordon.p...@gmail.com on 27 Jul 2008 at 10:06

GoogleCodeExporter commented 8 years ago
I tried testing this with:
for I in $(ls wdata/testwfile/*.warc); do ./warcvalidator -v -f $I; done

most files in the supplied test-data in wdata/testwfile/ failes with:
> debug: lib/private/wheader.c :2048:"Incompatible Warc Version"

Should the test-data not be updated to a warcversion that the warc-tools 
handles?

Original comment by bjarne.a...@gmail.com on 21 Oct 2008 at 5:27

GoogleCodeExporter commented 8 years ago
Tha validation of illegal header fields does not seem to work.

If I change this headerline:
WARC-Block-Digest:sha1:10451-9645-AM35P-12LL

to this one:
WARC-Block-Bjarne:sha1:10451-9645-AM35P-12LL

The warcvalidator still validates ? - is this OK ?

Original comment by bjarne.a...@gmail.com on 28 Oct 2008 at 7:20