kotartemiy / newscatcher

Programmatically collect normalized news from (almost) any website.
https://newscatcherapi.com/
MIT License
2.94k stars 284 forks source link

Article body text #5

Closed petulla closed 4 years ago

petulla commented 4 years ago

Hi

I'm wondering if you plan to provide article body text in the future. Looking at a handful of publishers, I wasn't able to find it in their news collections.

kotartemiy commented 4 years ago

Hey, you can achieve this with https://github.com/codelucas/newspaper

Just take the URL of each article and get the info with newspaper3k