codelucas / newspaper

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
https://goo.gl/VX41yK
MIT License
14.06k stars 2.11k forks source link

Can i get modified date for the urls fetched through newspaper3k? #914

Open purnima-kumari95 opened 2 years ago

johnbumgarner commented 2 years ago

Currently, Newspaper3k isn't programmed to extract the modified date from articles. It's doubtful that this feature will be added to Newspaper3k, because development is stagnant.

I'm currently working on a new news extractor named NewsHound . It does have this functionality. Please note that this Python package is still in Alpha thus hasn't been released publicly.

Here is a list of some of the site specific sources that are being scraped at this time. Additionally sources will be added before the package is released.

Are there any specific sources that you need?

purnima-kumari95 commented 2 years ago

When can I get the first release of the implemented library? @johnbumgarner