NERC-CEH / dri_gridded_data

GNU General Public License v3.0
0 stars 0 forks source link

Investigate options for running on a spark cluster #27

Open dolegi opened 3 weeks ago

dolegi commented 3 weeks ago

there are some problems using the directrunner on jasmin (crashes on http errors). We need a better way to run the scripts, probably a spark cluster would be most stable, using the beam to spark runner.

Investigate how we can do this, does jasmin have an existing spark cluster, is there one available to CEH somewhere (Iain might know/be able to point in a direction).

Another option is spinning up our own spark cluster on jasmin, investigate if possible/reasonable.

acceptance criteria