stfc / janus

collection of scripts to train and generate data for machine learnt interatomic potentials
BSD 3-Clause "New" or "Revised" License
0 stars 0 forks source link

Slurm errors for over 1000 jobs #30

Closed ElliottKasoar closed 1 year ago

ElliottKasoar commented 1 year ago

There is a maximum index that can be passed to SBATCH --array, which can lead to errors when writing and submitting scripts, such as active_learning_lammps.sh. I believe this maximum is admin controlled via MaxArraySize=1001.

We could be fix this by defining a new variable that adds 1000 (etc.) to the job array, although multiple scripts will be required to allow all jobs to be submitted.