aws-samples / aws-parallelcluster-monitoring

Monitoring Dashboard for AWS ParallelCluster
MIT No Attribution
31 stars 23 forks source link

vpenso/prometheus-slurm-exporter parsing errors #12

Closed bollig closed 9 months ago

bollig commented 2 years ago

https://github.com/aws-samples/aws-parallelcluster-monitoring/blob/e92abfce17a9a274558ee7deaeeccd09d86635d1/parallelcluster-setup/install-monitoring.sh#L86

This issue https://github.com/vpenso/prometheus-slurm-exporter/issues/55 impacts pcluster users. Quick fix is provided in this patch: https://github.com/MK4H/prometheus-slurm-exporter/commit/c48dc3d5b3d0ff5097c292957296161dadebbdb3

Until the fix is merged upstream, change

https://github.com/aws-samples/aws-parallelcluster-monitoring/blob/e92abfce17a9a274558ee7deaeeccd09d86635d1/parallelcluster-setup/install-monitoring.sh#L86

to

git clone https://github.com/vpenso/prometheus-slurm-exporter.git
sed -i 's/NodeList,AllocMem,Memory,CPUsState,StateLong/NodeList: ,AllocMem: ,Memory: ,CPUsState: ,StateLong:/' prometheus-slurm-exporter/node.go