sensu / catalog

Monitoring as code for Sensu Go. "There's a template for that!"
8 stars 5 forks source link

Feature request: Internal Sensu, etcd and Postgres monitoring, and alert integration #336

Open jhenderson-pro opened 1 year ago

jhenderson-pro commented 1 year ago

Feature Suggestions

As a Sensu operator, it would be helpful to have a one-stop shop catalog integration to monitor Sensu with Sensu, Including Sensu backends, embedded or external etcd, and Postgres health monitoring and alerts.

Context

Working with a customer, they brought up the point that we do not currently have complete integration to 'monitor Sensu with Sensu.' However, we currently have pieces of this to roll your own and accomplish the same result, but they're requesting an integration to have them rolled together.

portertech commented 1 year ago

What documentation do we have around monitoring sensu with sensu?

portertech commented 1 year ago

Working on requirements.

We have documented monitoring sensu with sensu, using /health. https://docs.sensu.io/sensu-go/latest/operations/monitor-sensu/monitor-sensu-with-sensu/

CE stated that there is a gap in regards to monitoring etcd, we need one or more key metrics to monitor.

portertech commented 1 year ago

We will need a big disclaimer that a sensu cluster cannot monitoring itself, it needs a partner (another cluster).

portertech commented 1 year ago

@jhenderson-pro please add customer etcd issue context.

jhenderson-pro commented 1 year ago

Initially, a customer had an issue with unbounded etcd growth (internal ref in the CE priority queue). This had a significant impact on Sensu's health. This feature is requested to facilitate simpler configuring of monitoring and alerts for critical Sensu components in an integration with the goal of some warning before the environment becomes unresponsive. Initially, internal or external etcd, Postgres, and the backends. Etcd disk quotas are a common support request and may be good to include in an integration.