AndyTheFactory / newspaper4k

📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
MIT License
429 stars 37 forks source link

Can i get modified date for the urls fetched through newspaper3k? #538

Open AndyTheFactory opened 10 months ago

AndyTheFactory commented 10 months ago

Issue by purnima-kumari95 Fri Oct 8 12:18:02 2021 Originally opened as https://github.com/codelucas/newspaper/issues/914


None

AndyTheFactory commented 10 months ago

Comment by johnbumgarner Sun Nov 7 13:25:14 2021


Currently, Newspaper3k isn't programmed to extract the modified date from articles. It's doubtful that this feature will be added to Newspaper3k, because development is stagnant.

I'm currently working on a new news extractor named NewsHound . It does have this functionality. Please note that this Python package is still in Alpha thus hasn't been released publicly.

Here is a list of some of the site specific sources that are being scraped at this time. Additionally sources will be added before the package is released.

Are there any specific sources that you need?

AndyTheFactory commented 10 months ago

Comment by purnima-kumari95 Fri Nov 12 05:30:42 2021


When can I get the first release of the implemented library? @johnbumgarner