concourse / hush-house

Concourse k8s-based environment
https://hush-house.pivotal.io
29 stars 23 forks source link

metrics: node-pool error panel #112

Open cirocosta opened 4 years ago

cirocosta commented 4 years ago

We have an error panel set up for the Node dashboard which allows us to tell for a specific worker the rate of messages being sent to the kernel messages circular buffer.

https://github.com/concourse/hush-house/blob/4382b2a01bd5cd26cab9df4022c23819eef1d41d/deployments/with-creds/metrics/dashboards/concourse/node.json#L2969-L2975

Screen Shot 2020-01-16 at 4 57 14 PM

It'd be great to have a similar panel under Node Pools that'd highlight possible errors going on across an entire node pool.