iipc / webarchive-commons

Common web archive utility code.
Apache License 2.0
50 stars 72 forks source link

ExtractingParseObserver: extract rel, hreflang and type attributes #86

Closed sebastian-nagel closed 4 years ago

sebastian-nagel commented 4 years ago

cf. commoncrawl/ia-web-commons#10