codelucas / newspaper

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
https://goo.gl/VX41yK
MIT License
14.09k stars 2.11k forks source link

It turns out that a lot of sites do not work with #937

Open alekssamos opened 2 years ago

alekssamos commented 2 years ago

I am completely disenchanted. Why these dictionaries, key stop words? From many sites, instead of the text of the article, there is an empty line. I definitely didn't expect this.

johnbumgarner commented 2 years ago

What are some of the sites that aren't extracting for you?

Baytars commented 2 years ago

Pubmed.

What are some of the sites that aren't extracting for you?