fhamborg / news-please

news-please - an integrated web crawler and information extractor for news that just works
Apache License 2.0
1.99k stars 414 forks source link

Remove newspaper3k from setup #273

Open Medno opened 2 days ago

Medno commented 2 days ago

Hey @fhamborg 👋

We've noticed that newspaper4k has been imported in the setup. Since newspaper3k is also installed, it may be confusing to know which library is used to handle the articles because both libraries can be imported using : from newspaper import ...

I also made some formatting changes to the setup.py file.

Let me know if you have any feedback 🙏

Medno commented 1 day ago

Hey @fhamborg Currently, it's causing some crashes since both packages are installed in the newspaper/ folder. There are imports from the library like from newspaper.utils import whatever that conflict because in one library utils it's a file, and in the other, it's a folder.