codelucas / newspaper

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
https://goo.gl/VX41yK
MIT License
14.06k stars 2.11k forks source link

Categories filters don't work as expected #740

Open varuncheq opened 4 years ago

varuncheq commented 4 years ago

Hi team,

I've been trying to extract articles for a particular category https://www.dailymail.co.uk/health/index.html, however it when I check the articles being fetched, I get everything, not just the ones under health

paper = newspaper.build('https://www.dailymail.co.uk/health/index.html')
for article in paper.articles:
     print(article.url)

Is there something I'm missing? Cheers

Neileruaa commented 3 years ago

Hey ! Did you find a solution to your problem ? I am also looking for a solution...