Closed jbnjohnathan closed 11 months ago
Discussed on the Community call on 19.10.2023: The Prometheus Alert Manager rules provide examples. You can modify them in any way or remove / disable the rule if you want.
Discussed on the Community call on 19.10.2023 for the second time: After further discussion, it seems it does not make sense to exclude only __consumer_topics
and not for example transaction state. We should either exclude all of Kafka's internal topics (consumer offsets, transaction state). Or keep all of them included. @mimaison will have a look at it.
The other topics from the Topic Operator or the Cruise Control are regular topics as any other. Whertehr no messsages for them should be an alert or not depends on the exact use-case and situation as with any other topics. So they should not be excluded.
Bug Description
When applying the default Alertmanager rules from https://github.com/strimzi/strimzi-kafka-operator/blob/main/examples/metrics/prometheus-install/prometheus-rules.yaml there are a lot of false positives from the topics created by strimzi. For example:
Steps to reproduce
Expected behavior
If some built-in topics are not expected to be written to regulary they should be excluded from the prometheus rules, just like the topis matching
__consumer_offsets
are nowStrimzi version
0.35.1
Kubernetes version
v1.25.4
Installation method
Helm
Infrastructure
OpenShift
Configuration files and logs
No response
Additional context
No response