machawk1 / warcreate

Chrome extension to "Create WARC files from any webpage"
https://warcreate.com
MIT License
205 stars 13 forks source link

WARC file names should follow the format recommended in Annex C #138

Open machawk1 opened 1 year ago

machawk1 commented 1 year ago

See https://iipc.github.io/warc-specifications/specifications/warc-format/warc-1.1/#annex-c-informative-warc-file-size-and-name-recommendations

Prefix-Timestamp-Serial-Crawlhost.warc.gz

is recommended.

WARCreate currently uses a 17-digit date stamp as the basis of the generated file name, e.g., 20230330135014298.warc.