scontrol show config issues RPC calls to master node.
When Slurm is configured with per-user RPC rate-limiting (rl_enable), the command can be throttled causing a large variance in rank start times since it is called in a hook executed per rank.
This PR leverages env variables to get the info for the PMI hook instead of calling scontrol show config.
scontrol show config
issues RPC calls to master node. When Slurm is configured with per-user RPC rate-limiting (rl_enable
), the command can be throttled causing a large variance in rank start times since it is called in a hook executed per rank. This PR leverages env variables to get the info for the PMI hook instead of callingscontrol show config
.CC @flx42 @3XX0