bosun-monitor / bosun

Time Series Alerting Framework
http://bosun.org
MIT License
3.4k stars 494 forks source link

Error trap collectors to prevent crashing the entire process #1511

Closed floyd-may closed 4 years ago

floyd-may commented 8 years ago

(spun off from #1510)

I noticed that a crashing elasticsearch collector was crashing the entire scollector process because I stopped getting hbase garbage collection stats.

If a collector can bring down the entire process, it puts collection of other stats in jeopardy. Is it possible to trap errors per-collector so they don't bring down stat collection for other collectors?

christoffbotha commented 8 years ago

+1 I also experienced this crash after I upgraded to elasticsearch 2. I know version 2 is not supported yet but we need a way to trap individual collector panics like this so they do not kill scollector.

stale[bot] commented 4 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.