When news nodes come into the cluster(s) we need to be able to add them to the monitoring stack- write an ansible role/playbook to do that.
The ansible code should:
1) install prometheus, node-exporter, dgcm (if its a compute node) and promtail
2) config each service to run as a systemd job as a non root user
3) change the /etc/prometheus/files.d/cerberus-cluster.json (or file(s) like it) on the prometheus instances to add or remove nodes as they come and go from the cerberus cluster.
When news nodes come into the cluster(s) we need to be able to add them to the monitoring stack- write an ansible role/playbook to do that.
The ansible code should:
1) install prometheus, node-exporter, dgcm (if its a compute node) and promtail 2) config each service to run as a systemd job as a non root user 3) change the /etc/prometheus/files.d/cerberus-cluster.json (or file(s) like it) on the prometheus instances to add or remove nodes as they come and go from the cerberus cluster.