update-operator is a long running Go process which supervises cluster-wide complex operations. As such it should expose metrics regarding its status, which can be scraped by Prometheus and alerted upon. Access to such endpoint should be governed by kubernetes RBAC policies.
update-operator
is a long running Go process which supervises cluster-wide complex operations. As such it should expose metrics regarding its status, which can be scraped by Prometheus and alerted upon. Access to such endpoint should be governed by kubernetes RBAC policies.This is a preliminary list of interesting metric:
reboot-needed
statebefore-reboot
stateafter-reboot
state