ReproNim / reproman

ReproMan (AKA NICEMAN, AKA ReproNim TRD3)
https://reproman.readthedocs.io
Other
24 stars 14 forks source link

run: add support for respecting limits on number of jobs #559

Open yarikoptic opened 3 years ago

yarikoptic commented 3 years ago

SLURM at tacc is configured to not allow more than 50 jobs, so when submitting, you would see smth like

          Welcome to the Lonestar 5 Supercomputer          
-----------------------------------------------------------------

No reservation for this job
--> Verifying valid submit host (login2)...OK
--> Verifying valid jobname...OK
--> Enforcing max jobs per user...OK
--> Verifying availability of your home dir (/home1/03372/yoh)...OK
--> Verifying availability of your work dir (/work/03372/yoh/lonestar)...OK
--> Verifying availability of your scratch dir (/scratch/03372/yoh)...OK
--> Verifying valid ssh keys...OK
--> Verifying access to desired queue (normal)...OK
--> Verifying job request is within current queue limits...FAILED

    [*] Too many simultaneous jobs in queue.
        --> Max job limits for normal =  50 jobs

interesting to see if/how Launcher (#557 ) deals with that