dankito / Readability4J

A Kotlin port of Mozilla‘s Readability. It extracts a website‘s relevant content and removes all clutter from it.
Apache License 2.0
145 stars 22 forks source link

don't show image for some url #14

Open allentown521 opened 3 years ago

allentown521 commented 3 years ago

https://www.natureworldnews.com/articles/45834/20210427/sumatran-rhinoceros-striving-genetic-diversity-despite-extinction.htm

There is a rhino picture at the beginning of the article, but after I parse the html using Readability4JExtended or Readability version 1.0.6, the img tag is gone and the picture is not displayed

allentown521 commented 2 years ago

https://tvline.com/2022/06/23/game-of-thrones-jon-snow-spinoff-hbo-kit-harington-george-rr-martin/

This is the same problem; so, is this library still maintained?