karussell / snacktory

Readability clone in Java
461 stars 159 forks source link

Text content is removed when there is an image in news webpage. #34

Open avi20072008 opened 11 years ago

avi20072008 commented 11 years ago

Hi,

I have tried using snacktory and It works well on the webpages which do not contain images. I have tried using one of the newspapers and I found that whenever there is an image, snacktory removes text block close to the image.

Try this url : http://articles.timesofindia.indiatimes.com/2013-09-17/rest-of-world/42147651_1_tropical-depression-mexico-city-heavy-rains

karussell commented 11 years ago

Would be nice if you could digg into it and provide a fix via pull request :) !