hmcts / roadmap-platform-operations

0 stars 0 forks source link

Configure Prometheus monitoring on AKS System Components #2228

Open hmcts-platform-operations opened 1 month ago

hmcts-platform-operations commented 1 month ago

EI-2259

Summary

Crime AKS clusters have used Prometheus as part of the initial build. Prometheus monitoring has been reduced to only a small subset of metrics to provide only scaling metrics for use with HPA.

Due to cost consideration we weren't allowed to use either Azure Monitor or Dynatrace for monitoring of AKS System Services in all environments. This creates blind spots and create duplication if we use different monitoring tools for different environments.

We would like to adopt Prometheus monitoring and Alerting for System Components of the AKS Clusters so we can keep the same consistent mechanism and configs across all environments

Intended Outcome

AKS System Components consistently monitored across all environment using Prometheus

Impact on Teams

How will this impact teams, put 'No impact' if none