lycheeverse / lychee

⚡ Fast, async, stream-based link checker written in Rust. Finds broken URLs and mail addresses inside Markdown, HTML, reStructuredText, websites and more!
https://lychee.cli.rs
Apache License 2.0
2.23k stars 136 forks source link

Ignore textContent links in html nodes #1527

Closed markogalevski closed 1 month ago

markogalevski commented 1 month ago

This fixes issue #1462 by remove plaintext uri parsing in html5ever and pruning attribute-less URIs in html5gum