Closed codekoriko closed 4 years ago
You're getting a DNS failure looking up "web.archive.org", not the first time, but the Nth time. And that's currently not retried. I'll have to add some code to distinguish the case of "misconfigured CDX hostname that will never work" from "I've successfully fetched from this host before so a DNS error should be retried".
Version 0.9.28, just released, retries dns failures for IA and CC's known hostnames with enthusiasm.
Please open another issue if this doesn't work for you.
It seems to happens only with ia as a source and not cc. It also quite seldom, i'd say once every 5000-8000 CaptureObject's content attribute access
my code that triggers:
here is the full traceback: