Closed djhmateer closed 10 months ago
Thank you for opening the issue, I've recently stumbled upon the same problem.
I'll try to look into it soon since it effectively blocks wacz collections when using a docker deployment.
After investigating the bug was introduced in this commit: https://github.com/bellingcat/auto-archiver/commit/987bbcaad083310791dda98687c24b9748089cfe
What happened? we removed pywb dependency from Pipfile thinking it was not being used but it is required so that browsertrix-crawler can work, it is installed in their own Dockerfile but since we use pipenv instead of the default pip installation it was not being accessed, and hence needs to be added explicitly to the Pipfile.
Thank you @msramalho it is working for me now!
Getting a proxy connection failed on the
wacz_archiver_enricher
on all urls.First time I've set this up, so probably something simple / maybe I've missed something.
Next step for me is to setup a local dev version and debug it.. but this issue may be useful for others at the same stage as me.
I have the profile setup in
secrets/profile.tar.gz
which I did viaOutput of the run is:
and
orchestation.yaml
is: