Open ocaisa opened 2 months ago
If using a scalable cluster and the Terraform Cloud token expires, nodes become "unresponsive". With Slurm < 24, the state of cloud nodes is not visible unless you set
PrivateData=cloud
in your slurm.conf. As stated in https://support.schedmd.com/show_bug.cgi?id=2771 this is the exact opposite of what you expect when setting this and this has been fixed in Slurm 24.05 .
slurm.conf
Is it possible to add this setting for Slurm < 24 ?
If you don't see that nodes are unresponsive, it's not easy to figure out why the cluster is not scaling
If using a scalable cluster and the Terraform Cloud token expires, nodes become "unresponsive". With Slurm < 24, the state of cloud nodes is not visible unless you set
in your
slurm.conf
. As stated in https://support.schedmd.com/show_bug.cgi?id=2771 this is the exact opposite of what you expect when setting this and this has been fixed in Slurm 24.05 .Is it possible to add this setting for Slurm < 24 ?