confluentinc / jmx-monitoring-stacks

📊 Monitoring examples for Confluent Cloud and Confluent Platform
Apache License 2.0
51 stars 167 forks source link

Request to add Alerts examples for Client side metrics #241

Open MosheBlumbergX opened 3 months ago

MosheBlumbergX commented 3 months ago

As a customer I would like to see additional examples for client metric alerts and thresholds.

Very much like what has been outlined here:

https://github.com/confluentinc/jmx-monitoring-stacks/tree/main/jmxexporter-prometheus-grafana/assets/prometheus/prometheus-alerts

Thanks, Moshe.

hifly81 commented 3 months ago

@MosheBlumbergX I was wondering what metrics should be alerted.

For producers: kafka.producer:type=producer-metrics,client-id="{client-id}" (record-error-rate)

For consumers: kafka.consumer:type=consumer-fetch-manager-metrics,client-id="{client-id}" (records-lag-max)

kafka.consumer:type=consumer-coordinator-metrics,client-id="{client-id}" (sync-rate)

any other ideas?