helgeho / Web2Warc

An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)
MIT License
24 stars 4 forks source link

Make the WARC-Record-ID conform to the specification #1

Closed thorkill closed 8 years ago

thorkill commented 8 years ago

WARC-Record-ID has to be valid URI as in RFC3986