Open mseiwald opened 4 years ago
I think there's a tool called postgres-exporter (maybe this one) which some use together with the operator by defining a sidecar.
@FxKu From what I understand postgres-exporter is for Database (PostgreSQL) metrics. I was talking about operator metrics (e.g. failed reconciliations etc.).
I think most relevant data is actually exposed on target objects and mainfest status, combine this with watching for errors.
We will look into the log reporting and levels used, to make errors really errors, which is not totally easy to decide given our resync and transient problems.
We also currently do not use or plan to use prometheus to monitor the operator so that would need to come as a contribution.
@mseiwald I have created a little PR with a Prometheus endpoint to check PG cluster sync status.
@FxKu, Could you give some insight on how we could get operator metrics (e.g. failed reconciliations etc.)? My use case - I want to get the metrics of whether clusters are successfully provisioned and what states they are in.
It would be great to have a prometheus metrics endpoint available in postgres-operator to be able add monitoring at the operator level (not the DBs themselves). An example would be when the operator fails to sync a DB for whatever reason. Currently I don't see a way to be notified about these events except parsing the operator's logs.