cantino / ruby-readability

Port of arc90's readability project to Ruby
Apache License 2.0
925 stars 171 forks source link

Twitter tweet content is not picked up #77

Closed thom4parisot closed 9 years ago

thom4parisot commented 9 years ago

Hello,

I tried readability on this page: https://twitter.com/jamescridland/status/555108097803694080 – which returned this HTML content:

<div><div> <p> Reply </p> <p> Retweet Retweeted </p> <p> Favourite 1 Favourited 1 </p> </div></div>

Tweets are tricky, as they might not be long enough to be picked up – is the markup good enough to refine the main content though?

Thanks :-)

thom4parisot commented 9 years ago

I guess it's not a relevant case, as a tweet content is contained in OpenGraph attributes + the initial content is stored in a JavaScript object and even on a rendered page, the content is so small that it's nearly impossible to get it.