iipc / webarchive-commons

Common web archive utility code.
Apache License 2.0
50 stars 71 forks source link

Consider syncing up from the Common Crawl fork #94

Closed anjackson closed 2 years ago

anjackson commented 2 years ago

See https://github.com/iipc/webarchive-commons/compare/master...commoncrawl:ia-web-commons:master

There's a few things there that might be worth pulling in.

anjackson commented 2 years ago

Ah, looks like #84 and #87 should be dealt with first.