BCDevOps / developer-experience

This repository is used to track all work for the BCGov Platform Services Team (This includes work for: 1. Platform Experience, 2. Developer Experience 3. Platform Operations/OCP 3)
Apache License 2.0
8 stars 17 forks source link

Configure Vault for additional metrics collection #5215

Open IanKWatts opened 1 month ago

IanKWatts commented 1 month ago

Describe the issue Vault can provide detailed metrics to Prometheus/Sysdig that will allow us to improve the monitoring and alerting for this service, but it must be configured for it. Configure the Vault server StatefulSet according to the Hashicorp documentation to provide the additional metrics recommended in the Maturity Assessment.

This will involve securing the metrics endpoint, either by using a Vault token or with a path-based route. Prefer option 1.

What is the Value/Impact? Better monitoring and alerting = better reliability

What is the plan? How will this get completed? Configure the Vault server according to the docs.

Identify any dependencies n/a

Definition of done

IanKWatts commented 3 weeks ago

Key metrics: https://developer.hashicorp.com/vault/docs/internals/telemetry/key-metrics#resource-metrics Telemetry: https://developer.hashicorp.com/vault/docs/configuration/telemetry