m-lab / scraper

Scrape experiment data off of MLab nodes and upload it to Google Cloud Storage
Apache License 2.0
5 stars 5 forks source link

ClusterDown #245

Closed measurementlab closed 6 years ago

measurementlab commented 6 years ago

Alertmanager URL: http://status.mlab-oti.measurementlab.net:9093

TODO: add graph url from annotations.

stephen-soltesz commented 6 years ago

This was due to a cluster node auto-upgrade! 1.7.11 -> 1.8.6 -- the sensitivity of ClusterDown alert should be adjusted. e.g. from 3m to 10m to prevent the false positive.

Created: https://github.com/m-lab/prometheus-support/pull/180 to fix.

http://35.188.22.107:9090/graph?g0.range_input=6h&g0.end_input=2018-01-25+14%3A40&g0.step_input=120&g0.stacked=0&g0.expr=rate(process_cpu_seconds_total%7Bcontainer%3D%22prometheus%22%7D%5B5m%5D)&g0.tab=0&g1.range_input=6h&g1.end_input=2018-01-25+14%3A40&g1.expr=kube_node_info%7Bcluster%3D%22scraper-cluster-prometheus-pool%22%2C+node%3D~%22gke-scraper-cluster-prometheus-pool.*%22%7D%09&g1.tab=0