xchem / xchem_it

Issues for XChem IT work
0 stars 0 forks source link

Deploy Prometheus for monitoring #1

Open tdudgeon opened 3 years ago

tdudgeon commented 3 years ago

Deploy Prometheus stack to DEV and PROD clusters to allow fine grain monitoring of the cluster.

tdudgeon commented 3 years ago

Prometheus is deployed to the dev cluster, along with persistent storage for Prometheus and Grafana (currently Cinder volumes, but we may switch to Longhorn). Data is retained for 5 days. We should monitor this for a few days to confirm it's all working correctly.

tdudgeon commented 3 years ago

Rancher 2.5 supports Prometheus in a very different way to older versions. This required Prometheus to be uninstalled and then re-installed using the new mechanism (Cluster Explorer -> Apps & Marketplace -> Monitoring).

This was done for the dev cluster on 19-APR-2021.

To see metrics in Lens set the prometheus service to cattle-monitoring-system/rancher-monitoring-prometheus:9090. Not all metrics are visible in Lens.