iipc / warc-specifications

Centralised repository for WARC usage specifications.
http://iipc.github.io/warc-specifications/
100 stars 30 forks source link

Truncated responses are not anymore required to be marked with WARC-Truncated since 1.1 #62

Closed JustAnotherArchivist closed 2 years ago

JustAnotherArchivist commented 4 years ago

The WARC/1.0 specification contained the following sentence in the response record definition:

When a 'response' is known to have been truncated, this shall be noted using the WARC-Truncated field.

As far as I can see, the 1.1 spec does not have an equivalent statement. The definition of WARC-Truncated only mentions that the field may be used to indicate truncated responses. Revisit records with identical payloads are required to use the WARC-Truncated field, but this obviously doesn't apply to all other reasons for truncation, like timeouts or size limits.

ato commented 2 years ago

Since it's present in the BNF final drafts and the missing chunk of text aligns exactly to a page boundary I'm assuming it was unintentionally missed during the creation of the markdown file. I've added it back.