Open ebensom opened 5 months ago
@ebensom : can we please quickly sync about it? We have open questions
@ebensom : I will setup a call for it to clarify the purpose. We want avoid to increase load on Gardener caused by redundant health-checks from us + additional monitoring etc.
Istio offers HTTP request metrics but those metrics are only available if traffic is used via plain HTTP but for Gardener HTTP connections are not possible as it enforces HTTPS communication.
Option to implement a check via Prometheus client in KCP would be possible, but this won't reflect whether the KIM is really able to talk to Gardener (it's not reflecting the truth).
@ebensom : we will only implement it if you send to each of us a "Thank you " award ;)
Description
Implement periodic health checking of Gardener cluster API dependency by periodically querying of the version or health non-resource endpoint via gardener kubeclient in a separate goroutine and keep the latest check result up-to-date. Expose the current (up-to-date) healthcheck result on the Prometheus metrics endpoint via series like:
Reasons
Ability to cross-correlate infrastructure-manager errors with Gardener API (dependency) errors.
Attachments