thomas-krenn / check_ipmi_sensor_v3

Monitoring plugin to check IPMI sensors
https://www.thomas-krenn.com/en/wiki/IPMI_Sensor_Monitoring_Plugin
GNU General Public License v3.0
54 stars 21 forks source link

Feature Request - Report "OK" When Timeout #23

Closed wright546 closed 6 years ago

wright546 commented 6 years ago

I have a problem where the connection times out on multiple remote servers I monitor, which isn't a problem in itself. The problem is that there are often remote servers with failed hardware components that take a few days for the replacement hardware to arrive. In that time I "Acknowledge" the problem in Icinga2. But when a connection times out, Icinga marks it as "Warning" which resets the acknowledgement. Then when it successfully connects again it re-detects the same hardware failure. This happens multiple times a day, and I have to "Acknowledge" the error over and over.

The only way I can think of to deal with this is to have check_ipmi_sensor report "OK" when the connection times out. Is this possible?

Is there a better way to deal with this problem that I'm not thinking of?

Thanks for any help you can give