inhumantsar / slurp

Slurps webpages and saves them as clean, uncluttered Markdown. Think Pocket, but better.
https://inhumantsar.github.io/slurp/
MIT License
127 stars 2 forks source link

Parsing issues #18

Closed chrisgrieser closed 2 months ago

chrisgrieser commented 2 months ago

author, as well type are frequently not parsed, even with very common news sites. One example is the verge: https://www.theverge.com/2017/12/2/16727166/apple-ios-11-2-features-release

inhumantsar commented 2 months ago

thanks for letting me know! just pushed out a release that fixes this.

chrisgrieser commented 2 months ago

Thanks! May I ask what exactly is determining the type? In the link above, I'd consider it to be an article, but slurp categorizes it as post.

inhumantsar commented 2 months ago

all of the metadata is self reported by the website, slurp just collects it

chrisgrieser commented 2 months ago

I see. Continuing with a different feature request then, since this is solved