Open andisoer opened 3 months ago
I just tried something with Linkedin newsetters and it doesn't capture anything :/
There was a update from Google's side, the ID after /article/
used to be a base64 string representing the original website.
Since July, that changed and is not real anymore (community doesn't seems to know how to parse it btw)
Here's a reference: https://gist.github.com/huksley/bc3cb046157a99cd9d1517b32f91a99e
There's some community's member implementation of this code in Python -> https://github.com/SSujitX/google-news-url-decoder/blob/main/googlenewsdecoder/new_decoderv1.py
This requests Google for the URL though, so it may hit some 429's (which are very annoying). But i found no other solution but to do that before sending the URL to newspaper3k.
For the last few days, the parser using
only return Google RSS Images which is
and the title
instead of original articles images and titles, any issue on this parser or any update from Google RSS News?