Closed Atisom closed 2 days ago
when I remove the '--collect.fullslurm' flag, it works again. Maybe it cannot measure some kind of job?
There is no --collect.fullslurm
flag on this exporter. It looks like you may be running a version of this exporter that is a fork with extra functionality.
$ cgroup_exporter --help 2>&1 | grep collect
--[no-]collect.proc Boolean that sets if to collect proc information
--collect.proc.max-exec=100
oh, sorry for that. The README.md file on the fork repo (https://github.com/plazonic/cgroup_exporter) confused me a bit :)
Dear Team,
We use the cgroup_exporter in more compute node, but sometime we got this error message:
I tried to restart the cgroup_exporter and the slurmd services, but it didn't solve the problem. After I rebooted the whole compute node, the issue resolved. Do you have any idea?