iipc / warc-specifications

Centralised repository for WARC usage specifications.
http://iipc.github.io/warc-specifications/
100 stars 30 forks source link

CDX 2015: index.md fix typos #68

Closed VADemon closed 4 years ago

VADemon commented 4 years ago

I think I didn't do worse with any of the edits :) Feel free to edit/dismiss this update.

JustAnotherArchivist commented 4 years ago

I'm pretty sure that 'N massaged url' is not a typo. The massaged_url property in CDX-Writer removes and replaces problematic characters in a URL and transforms it into a canonical form. It fixes the URL's back pain with a massage, so to say.

The other changes seem fine.

ato commented 4 years ago

Thanks! I took in commit 48b27a0634bf1c4283203dcd06310b7a5439e5c4 all changes except s/massaged/messaged/ and the dots in front of file format names.