there are some problems using the directrunner on jasmin (crashes on http errors).
We need a better way to run the scripts, probably a spark cluster would be most stable, using the beam to spark runner.
Investigate how we can do this, does jasmin have an existing spark cluster, is there one available to CEH somewhere (Iain might know/be able to point in a direction).
Another option is spinning up our own spark cluster on jasmin, investigate if possible/reasonable.
acceptance criteria
We know how to move forward with running the scripts on a spark cluster
there are some problems using the directrunner on jasmin (crashes on http errors). We need a better way to run the scripts, probably a spark cluster would be most stable, using the beam to spark runner.
Investigate how we can do this, does jasmin have an existing spark cluster, is there one available to CEH somewhere (Iain might know/be able to point in a direction).
Another option is spinning up our own spark cluster on jasmin, investigate if possible/reasonable.
acceptance criteria