JuliaParallel / ClusterManagers.jl

Other
232 stars 74 forks source link

Better handling of SLURM job submission timing #175

Closed kescobo closed 2 years ago

kescobo commented 2 years ago

See related issues: #174 #173.

@David96 is planning to fix based on idea from @DrChainsaw

Fwiw, the lsf manager lets the user supply an iterator with retry delays. The default is an exponential backoff with a max number of attempts, but I tend to just use Iterators.cyclic(5) to not have to deal with timeouts as the startup time at my place is extremely unpredictable.