Python 3.8 is incompatible with cpg-utils (new style type annoations) so this is updated to 3.10 by bumping Ubuntu to 22.04. cpg-utils is required to authenticating with github to clone private repos and this step was failing.
Adds fix to dataproc image which addresses pip depdendency resolution error in deploy.yaml when dataproc cluster is being initialised
Also updates documentation to explain these things.
Michael note: I'll follow-up and update the authentication of this repo in another PR.
Context: https://centrepopgen.slack.com/archives/C04M1G5HLM9/p1690947796216669
TLDR: Fixes a few issues in the dataproc image
Also updates documentation to explain these things.
Michael note: I'll follow-up and update the authentication of this repo in another PR.