mediacloud / metadata-lib

How Media Cloud approaches extracting metadata from online news stories
Apache License 2.0
12 stars 5 forks source link

ignore ports & handle IP domains in `normalize_url` #77

Closed rahulbot closed 10 months ago

rahulbot commented 10 months ago

Tweaks to improve URL normalization for #72