ageitgey / node-unfluff

Automatically extract body content (and other cool stuff) from an html document
Apache License 2.0
2.15k stars 223 forks source link

Bad lazy author extraction #94

Open 8enmann opened 5 years ago

8enmann commented 5 years ago

https://medium.com/@pimterry/host-your-node-app-on-dokku-digitalocean-1cb97e3ab041

Pulls out <link rel="author" href="https://medium.com/@pimterry"> instead of <meta property="article:author" content="Tim Perry">

8enmann commented 5 years ago

Also happens with non-lazy mode