$ sacct
sacct: error: slurm_persist_conn_open_without_init: failed to open persistent connection to host:localhost:6819: Connection refused
sacct: error: Sending PersistInit msg: Connection refused
sacct: error: Problem talking to the database: Connection refused
Steps to Reproduce the Problem
install az-hop with cc-slurm 3.x and slurm 23.x
Solution
The problem is that /anfhome/slurm/config/accounting.conf is configured to point to localhost:
However, slurmdbd only runs on the scheduler node (sacct works fine there).
To fix, change localhost to {{ scheduler.name }} from the config file.
(there used to be logic for this in the slurm.conf.j2 template, but it seems this is no longer used with cc-slurm 3.x)
Version
1.0.40
In what area(s)?
Expected Behavior
sacct
should work on theondemand
nodeActual Behavior
Steps to Reproduce the Problem
install az-hop with cc-slurm 3.x and slurm 23.x
Solution
The problem is that
/anfhome/slurm/config/accounting.conf
is configured to point tolocalhost
:However,
slurmdbd
only runs on the scheduler node (sacct
works fine there).To fix, change
localhost
to{{ scheduler.name }}
from the config file. (there used to be logic for this in theslurm.conf.j2
template, but it seems this is no longer used with cc-slurm 3.x)