Open jchristgit opened 7 months ago
As discussed in the dev-ops channel, I think we can reach a configuration here that utilizes our existing High-Availability AlertManager setup.
We can set up token access for Prometheus on Ansible machines to push alerts through to the Kubernetes HA AlertManager.
Some notes:
Just to clarify this from a discussion on Discord, this is about adding a "dead man's switch" alert that will route to Discord in case the Netcup Prometheus instance can't contact the Alertmanager in Kubernetes properly. To cover this case we want to:
We need to configure our Alertmanager to send us alerts on Discord such that we can be informed of anything not being right as part of the monitoring setup on lovelace.