DocNow / diffengine

track changes to the news, where news is anything with an RSS feed
MIT License
177 stars 30 forks source link

Error message: WARNING - not tweeting without archive urls #58

Closed yesboyicecream closed 4 years ago

yesboyicecream commented 4 years ago

I tried to set up diffengine yesterday and after a few teething issues, including installation failing due to the "--process-dependency-links" error, it seemed to run fine when I rolled back to Pip 18.

However, it does not tweet.

I do have 3 directories in my diffs folder but I noticed that differences were not tweeted. Looking through the diffengine log, there are three error messages that presumably relate to this: "WARNING - not tweeting without archive urls"

Not sure whether other errors in the diffengine.log are related:

"ERROR - unable to get archive id from None" "ERROR - unexpected archive.org response for https://web.archive.org/save/https://www.blahblah.com"

Any ideas?

edsu commented 4 years ago

Sorry you ran into the pip problem--but I'm glad you figured it out.

Just for the sake of replicating the problem are you really trying to archive https://blahblah.com? If you prefer you can email me the URL you are trying to monitor to ehs@pobox.com

edsu commented 4 years ago

@yesboyicecream it looks like the SavePageNow API changed as the result of some small changes at the Internet Archive. Thank you very much for reporting this, as their changes mean that all diffengine instances are not currently working properly!

Could you please try upgrading to v0.2.5 which was just relased and see if your problem goes away?

yesboyicecream commented 4 years ago

@edsu It doesn't seem to have solved the issue for me.

There is also a different error message: 2020-03-16 19:17:00,074 - root - ERROR - unexpected archive.org response for https://web.archive.org/save/https://www.irishtimes.com/news/ireland/irish-news/coronavirus-airlines-set-to-cancel-up-to-80-of-flights-1.4204016: 'EntryVersion' object has no attribute 'save_url'

The feed is: https://www.irishtimes.com/cmlink/news-1.1319192

I've followed the web.archive link and just wondering is it possible my problem is that some of this content appears to be behind a paywall?

edsu commented 4 years ago

Ok let me investigate some more, and thanks for sharing the URL to test with!

yesboyicecream commented 4 years ago

I just noticed that this tweeted this morning. Not sure what the difference is?

edsu commented 4 years ago

Thanks for letting me know! I think that the SavePageNow service is in a state of flux at the moment, and sometimes it is failing to respond (Service Unavailable). Its still good to keep this open because diffengine should account for this, and also log when it happens better.

nahuelhds commented 4 years ago

Hi guys! I'm facing the "'EntryVersion' object has no attribute 'save_url'" message. Is something I should worry about at all? I'm just building a new bot for diffing an uruguayan newspaper

edsu commented 4 years ago

Sorry for the delay on this one. I just released v0.2.6 to PyPI which hopefully fixes this problem. Please reopen this ticket if you continue to have problems -- and thanks for taking the time to report it!