awkward / Ocarina

A library to get metadata and Open Graph information from URLs
MIT License
38 stars 13 forks source link

Not pulling the data from certain websites #24

Closed jaysonng closed 2 years ago

jaysonng commented 3 years ago

Hi,

I've been trying out Ocarina on different news sites but on this one website, weirdly, it doesn't pull the og:description and other details. It only has url and originalUrl to the best of my knowledge.

https://www.philstar.com/other-sections/news-commentary/2021/07/16/2112975/necropolitics-death-and-politics-usual

which does have the headers

Screen Shot 2021-07-17 at 12 13 33 AM

yet I always get nil.

Other sites I've tried has worked so far pulling the data.

Would you know why this is and if there's a fix?

thanks,

jaysonng commented 2 years ago

This is a utf8 encoding issue with the site and won't be fixed via Kanna.

Kanna issue#257