medialab / ural

A helper library full of URL-related heuristics.
GNU General Public License v3.0
63 stars 12 forks source link

urls_from_html enhancements #172

Closed Yomguithereal closed 1 year ago

Yomguithereal commented 1 year ago
Yomguithereal commented 1 year ago

Check urls starting with // and also maybe rewrite the regex to be less greedy.

Should also check that relative url with .. . works