cloudoperators / greenhouse-extensions

Extensions for Greenhouse, the cloud operations platform
Apache License 2.0
1 stars 2 forks source link

[kube-monitoring] Define default rules #120

Closed richardtief closed 2 weeks ago

richardtief commented 3 months ago

Context

kube-monitoring is shipped with a standard set of alerting rules from the kubernetes-mixin project, which have a very high formal quality but are also very generic. In addition, there is a proven set of alerting rules from SAP Converged Cloud [1], which is more action-oriented and causes less noise. To get the best out of both sets of rules, active review is required.

[1] https://github.com/sapcc/helm-charts/tree/master/prometheus-rules/prometheus-kubernetes-rules

Tasks

  1. Improve the formal quality of the SAP Converged Cloud prometheus-kubernetes-rules
  2. Create a separate Github repository holding the new rules along the playbooks.
richardtief commented 1 month ago

We created a dedicated repository to collect common Prometheus alerting rules and playbooks. https://github.com/cloudoperators/kubernetes-operations

richardtief commented 2 weeks ago

All rules are reviewed and polished. https://github.com/cloudoperators/kubernetes-operations/commit/f4f6577f8d94865db1a7a9b35d9075f02ce8a2e6