Doing a curl on the same URL https://journals.openedition.org/bibnum/pdf/889 does not gives a 0 status code.
At first, I thought it was linked to the fact that it is a direct fetch, but this is not the case in fact, it is considered as a real page. Maybe a problem linked to the fact that it is a PDF and hence rendered differently by the browser (usually downloaded instead of being displayed)?
On some occasion, we have WARC archives where the HTTP status code is 0 instead of 200.
When looking inside the WARC, we see that we indeed have a HTTP Header whose value is
HTTP/1.1 0 OK
Sample command to quickly reproduce the problem (only 53 pages are fetched):
Details about problematic WARC record:
Doing a curl on the same URL
https://journals.openedition.org/bibnum/pdf/889
does not gives a 0 status code.At first, I thought it was linked to the fact that it is a direct fetch, but this is not the case in fact, it is considered as a real page. Maybe a problem linked to the fact that it is a PDF and hence rendered differently by the browser (usually downloaded instead of being displayed)?
Is this normal / expected behavior or a bug?