AndyTheFactory / newspaper4k

📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
MIT License
348 stars 31 forks source link

How to get the list of all websites that are available for scraping? #530

Open AndyTheFactory opened 8 months ago

AndyTheFactory commented 8 months ago

Issue by aleksandar-devedzic Sun Jul 18 16:28:56 2021 Originally opened as https://github.com/codelucas/newspaper/issues/903


Is there a way to get a list of websites that can be crawled property with newspaper lib? For example newspaper.sources or something like tha?

AndyTheFactory commented 8 months ago

Comment by tspier Sun Jul 18 23:15:56 2021


Maybe one of the two files here? https://github.com/codelucas/newspaper/tree/master/newspaper/resources/misc

hfiamelgringo commented 2 weeks ago

Link to the 4k docs is here: https://github.com/AndyTheFactory/newspaper4k/tree/master/newspaper/resources/misc

@AndyTheFactory Should we mark this as resolved or add this to the documentation?