machawk1 / warcreate

Chrome extension to "Create WARC files from any webpage"
https://warcreate.com
MIT License
206 stars 13 forks source link

Pages that report HTTP 304 Not Modified are not replayable #53

Closed machawk1 closed 10 years ago

machawk1 commented 10 years ago

Manually changing this to 200 OK and adjusting the Content-Length of the preceding WARC header fixes this issue but probably isn't the right approach. Given that WARCreate uses the page's HTML as the source for the content, this might be an option we want to allow the user to customize, as it might tie into implementing WARC revisit records.

machawk1 commented 10 years ago

Message Wayback returns on replaying content with 304 still intact:

Bad Content Exception

The content that was archived is not replayable.