Right now when processing a "historic" CSV file where HTML is to be found by legacy system downloads_id in an S3 bucket, if the object doesn't exist:
Nothing is logged
No counter is incremented
The story is retried (in an hour, or more)
Retries are repeated until the story is quarantined
This change tries to detect non-existent objects by having hist-fetcher examine the boto ClientError exception, logging and counting as "not-found" and discarding the story if the error was "404"
Right now when processing a "historic" CSV file where HTML is to be found by legacy system
downloads_id
in an S3 bucket, if the object doesn't exist:This change tries to detect non-existent objects by having hist-fetcher examine the boto
ClientError
exception, logging and counting as "not-found" and discarding the story if the error was "404"