Open joonas-fi opened 4 years ago
Ah, oops. Not something I anticipated or encountered. How do you think we should handle this?
I dunno, this is a pickle. The obvious error is not being able to continue after 403. My data retrieval process just aborts.
But, what should we do about it? Sure, continue after the error. But, personally, I am not fan of losing any information. In this case the information is:
there once was an attachment, but we didn't manage to fetch it in time because it was later taken down because of a DMCA complaint
I'd prefer this to be stored in the data model. I haven't researched Timeliner's data model, something like attachment: {id: '987654321', permanentFetchFailureReason: '403 not found - Twitter or the author removed it?'}
?
Things to think about:
permanentFetchFailureReason
, should we still re-try fetching it? I guess chances are kinda slim that the likes of Twitter restore content. But then again, re-trying probably isn't expensive (if we re-try, maybe not pollute the log about errors we think are really likely to surface from re-try attempts)....if we're doing "full refresh" and we have a permanentFetchFailureReason, should we still re-try fetching it?
I concur with your conclusion that rechecking is inexpensive, and so worth trying.
403 is usually permanent, or something has to be changed on the server to remove that error.
Perhaps Timeliner should simply continue to the next item after seeing a 403. Log the 403, but continue on, since there's nothing we can do about it. This should probably be the behavior no matter what mode it's running in.
But I also agree that simply trying once or twice more before continuing on wouldn't be a bad idea, in case it was a fluke.
Here's the Tweet: https://twitter.com/janl/status/1113015555064201216
Error message:
That image URL redirects (when used with browser - different when API use?) to this DMCA warning.
Timeliner cannot cope with this, and trying to re-run Timeliner always gets me this and cannot continue.