brainsciencecenter / salt

saltstack repositories for managing the brainsciencecenter's machines
0 stars 0 forks source link

slurm-sync-gcp is too restrictive #11

Closed holderg closed 4 years ago

holderg commented 4 years ago

slurm-sync-gcp only looks at machines matching the -compute* pattern. All other compute nodes will get terminated as un-responsive.

The correct filter is the intersection of the running instances and the node list from slurm.conf.

holderg commented 4 years ago

Fixes #11

Needed to rework slurm-sync-gcp filter after queuying for running instances.

verified arbitrary compute nodes are not being killed after 30 seconds, but idle nodes are still being taken down.