Crime AKS clusters have used Prometheus as part of the initial build. Prometheus monitoring has been reduced to only a small subset of metrics to provide only scaling metrics for use with HPA.
Due to cost consideration we weren't allowed to use either Azure Monitor or Dynatrace for monitoring of AKS System Services in all environments. This creates blind spots and create duplication if we use different monitoring tools for different environments.
We would like to adopt Prometheus monitoring and Alerting for System Components of the AKS Clusters so we can keep the same consistent mechanism and configs across all environments
Intended Outcome
AKS System Components consistently monitored across all environment using Prometheus
Impact on Teams
How will this impact teams, put 'No impact' if none
EI-2259
Summary
Crime AKS clusters have used Prometheus as part of the initial build. Prometheus monitoring has been reduced to only a small subset of metrics to provide only scaling metrics for use with HPA.
Due to cost consideration we weren't allowed to use either Azure Monitor or Dynatrace for monitoring of AKS System Services in all environments. This creates blind spots and create duplication if we use different monitoring tools for different environments.
We would like to adopt Prometheus monitoring and Alerting for System Components of the AKS Clusters so we can keep the same consistent mechanism and configs across all environments
Intended Outcome
AKS System Components consistently monitored across all environment using Prometheus
Impact on Teams
How will this impact teams, put 'No impact' if none