thomas-krenn / check_ipmi_sensor_v3

Monitoring plugin to check IPMI sensors
https://www.thomas-krenn.com/en/wiki/IPMI_Sensor_Monitoring_Plugin
GNU General Public License v3.0
54 stars 21 forks source link

Plugin throws WARN when first FAN does not start with "1" #16

Closed ronator closed 6 years ago

ronator commented 7 years ago

Hi,

I monitor some HP ILO interfaces with this plugin and help of freeipmi in icinga2. It works great!

Today, I tried to implement this check for a different HP server and got a FAN Warning:

IPMI Status: Warning [Fan = Warning]

I found out, that this system has only 4 fans configured: 'Fan 3' up to 'Fan 6'. I tried to set '-fc 4' but that causes still a warning. I guess, the plugin expects fan numbers to start with '1'.

We do not plan to add 'Fan 1' and 'Fan 2' and the system is running just fine. So my question is:

Is or will there be a way to set something like a 'fan offset" so that the plugin will not warn if fan numbers start with something else than '1'? I am assuming that's causing the warning, but not 100% sure.

/usr/lib/nagios/plugins/check_ipmi_sensor -H 192.168.xxx.xxx -f /etc/freeipmi/icinga.conf -fc 4

IPMI Status: Warning [Fan = Warning]

Thanks in advance.

tk-wfischer commented 7 years ago

Hi ronator,

the -fc option only adds another option to verify the number of active fans: number of fans that should be active. If the number of current active fans reported by IPMI is smaller than then a Warning state is returned.

Please post the verbose output (-vvv option), than I can give you further hints how to solve your issue. I think you will need to add the -x option to exclude sensors for Fan 1 and/or Fan 2. But with the verbose output I can give you a concrete detailed information :-)

ronator commented 7 years ago

Hi and yes, let me check that right away.

ronator commented 7 years ago

I did not know you can exclude some fans with -x.

/usr/lib/nagios/plugins/check_ipmi_sensor -H 192.168.x.x -f /etc/freeipmi/icinga.conf -fc 4 -vvv

The verbose output is the following:

------------- debug output for sel (-vvv is set): ------------
  /usr/sbin/ipmi-sel was executed with the following parameters:
    /usr/sbin/ipmi-sel -h 192.168.x.x --config-file /etc/freeipmi/icinga.conf --driver-type=LAN_2_0 --output-event-state --interpret-oem-data --entity-sensor-names --sensor-types=all --exclude-sensor-types=
  output of FreeIPMI:
ID | Date        | Time     | Name                                             | Type         | State    | Event
5  | Jan-26-2015 | 10:31:28 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
6  | Jan-26-2015 | 10:31:28 | N/A                                              | N/A              | N/A      | OEM defined = 00h 03h 00h 02h 80h 00h
7  | Jan-26-2015 | 10:34:00 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
8  | Jan-26-2015 | 10:34:00 | N/A                                              | N/A              | N/A      | OEM defined = 00h FFh 09h C6h 54h 00h
9  | Jan-26-2015 | 10:34:41 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
10 | Jan-26-2015 | 10:34:41 | N/A                                              | N/A              | N/A      | OEM defined = 00h 03h 00h 02h 80h 00h
11 | Jan-26-2015 | 10:36:55 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
12 | Jan-26-2015 | 10:36:55 | N/A                                              | N/A              | N/A      | OEM defined = 00h B5h 0Ah C6h 54h 00h
13 | Jan-26-2015 | 10:40:59 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
14 | Jan-26-2015 | 10:40:59 | N/A                                              | N/A              | N/A      | OEM defined = 00h FFh 00h 05h 00h 00h
15 | Jan-26-2015 | 10:43:17 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
16 | Jan-26-2015 | 10:43:17 | N/A                                              | N/A              | N/A      | OEM defined = 00h 33h 0Ch C6h 54h 00h
17 | Jan-26-2015 | 10:45:48 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
18 | Jan-26-2015 | 10:45:48 | N/A                                              | N/A              | N/A      | OEM defined = 00h 02h 00h 04h 84h 00h
19 | Jan-26-2015 | 10:48:09 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
20 | Jan-26-2015 | 10:48:09 | N/A                                              | N/A              | N/A      | OEM defined = 00h 55h 0Dh C6h 54h 00h
21 | Jan-26-2015 | 10:48:35 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
22 | Jan-26-2015 | 10:48:35 | N/A                                              | N/A              | N/A      | OEM defined = 00h 03h 00h 02h 80h 00h
23 | Jan-26-2015 | 10:50:56 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
24 | Jan-26-2015 | 10:50:56 | N/A                                              | N/A              | N/A      | OEM defined = 00h FDh 0Dh C6h 54h 00h
25 | Jan-26-2015 | 11:49:44 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
26 | Jan-26-2015 | 11:49:44 | N/A                                              | N/A              | N/A      | OEM defined = 00h 01h 00h 01h 80h 00h
27 | Jan-26-2015 | 11:49:44 | N/A                                              | N/A              | N/A      | OEM defined = 01h 52h 00h 65h 00h 00h
28 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 02h 62h 00h 6Fh 00h 00h
29 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 03h 6Fh 00h 74h 00h 00h
30 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 04h 69h 00h 6Eh 00h 00h
31 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 05h 67h 00h 20h 00h 00h
32 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 06h 61h 00h 74h 00h 00h
33 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 07h 20h 00h 48h 00h 00h
34 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 08h 50h 00h 20h 00h 00h
35 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 09h 53h 00h 55h 00h 00h
36 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Ah 4Dh 00h 20h 00h 00h
37 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Bh 75h 00h 73h 00h 00h
38 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Ch 65h 00h 72h 00h 00h
39 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Dh 20h 00h 72h 00h 00h
40 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Eh 65h 00h 71h 00h 00h
41 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Fh 75h 00h 65h 00h 00h
42 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 10h 73h 00h 74h 00h 00h
43 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 11h 20h 00h 69h 00h 00h
44 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 12h 6Eh 00h 20h 00h 00h
45 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 13h 6Fh 00h 72h 00h 00h
46 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 14h 64h 00h 65h 00h 00h
47 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 15h 72h 00h 20h 00h 00h
48 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 16h 74h 00h 6Fh 00h 00h
49 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 17h 20h 00h 61h 00h 00h
50 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 18h 63h 00h 74h 00h 00h
51 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 19h 69h 00h 76h 00h 00h
52 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Ah 61h 00h 74h 00h 00h
53 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Bh 65h 00h 20h 00h 00h
54 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Ch 64h 00h 65h 00h 00h
55 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Dh 70h 00h 6Ch 00h 00h
56 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Eh 6Fh 00h 79h 00h 00h
57 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 1Fh 65h 00h 64h 00h 00h
58 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 20h 20h 00h 66h 00h 00h
59 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 21h 69h 00h 72h 00h 00h
60 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 22h 6Dh 00h 77h 00h 00h
61 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 23h 61h 00h 72h 00h 00h
62 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 24h 65h 00h 20h 00h 00h
63 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 25h 6Fh 00h 72h 00h 00h
64 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 26h 20h 00h 73h 00h 00h
65 | Jun-29-2017 | 03:20:21 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
66 | Jun-29-2017 | 03:20:21 | N/A                                              | N/A              | N/A      | OEM defined = 00h 02h 00h 02h 80h 00h
67 | Jun-29-2017 | 03:22:53 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
68 | Jun-29-2017 | 03:22:53 | N/A                                              | N/A              | N/A      | OEM defined = 00h 71h 64h 54h 59h 00h
------------- debug output for sensors (-vvv is set): ------------
  script was executed with the following parameters:
    /usr/lib/nagios/plugins/check_ipmi_sensor -H 192.168.x.x -f /etc/freeipmi/icinga.conf -fc 4 -vvv
  check_ipmi_sensor version:
    3.12
  FreeIPMI version:
    ipmi-sensors - 1.1.5
  FreeIPMI was executed with the following parameters:
    /usr/sbin/ipmi-sensors -h 192.168.x.x --config-file /etc/freeipmi/icinga.conf --quiet-cache --sdr-cache-recreate --interpret-oem-data --output-sensor-state --ignore-not-available-sensors --driver-type=LAN_2_0
  FreeIPMI return code: 0
  output of FreeIPMI:
ID | Name             | Type         | State    | Reading    | Units | Event
0  | UID Light        | OEM Reserved | N/A      | N/A        | N/A   | 'OEM Event = 0001h'
2  | 01-Inlet Ambient | Temperature  | Nominal  | 14.00      | C     | 'OK'
3  | 02-CPU 1         | Temperature  | Nominal  | 40.00      | C     | 'OK'
5  | 04-P1 DIMM 1-3   | Temperature  | Nominal  | 24.00      | C     | 'OK'
6  | 05-P1 DIMM 4-6   | Temperature  | Nominal  | 24.00      | C     | 'OK'
7  | 06-P1 DIMM 7-9   | Temperature  | Nominal  | 23.00      | C     | 'OK'
8  | 07-P1 DIMM 10-12 | Temperature  | Nominal  | 24.00      | C     | 'OK'
13 | 12-HD Max        | Temperature  | Nominal  | 35.00      | C     | 'OK'
14 | 13-Chipset       | Temperature  | Nominal  | 44.00      | C     | 'OK'
15 | 14-P/S 1         | Temperature  | Nominal  | 20.00      | C     | 'OK'
16 | 15-P/S 2         | Temperature  | Nominal  | 18.00      | C     | 'OK'
17 | 16-P/S 2 Zone    | Temperature  | Nominal  | 20.00      | C     | 'OK'
18 | 17-VR P1         | Temperature  | Nominal  | 26.00      | C     | 'OK'
19 | 18-VR P2         | Temperature  | Nominal  | 22.00      | C     | 'OK'
20 | 19-VR P1 Mem     | Temperature  | Nominal  | 23.00      | C     | 'OK'
21 | 20-VR P1 Mem     | Temperature  | Nominal  | 26.00      | C     | 'OK'
22 | 21-VR P2 Mem     | Temperature  | Nominal  | 22.00      | C     | 'OK'
23 | 22-VR P2 Mem     | Temperature  | Nominal  | 23.00      | C     | 'OK'
24 | 23-VR P1Vtt Zone | Temperature  | Nominal  | 25.00      | C     | 'OK'
25 | 24-VR P2Vtt Zone | Temperature  | Nominal  | 20.00      | C     | 'OK'
26 | 25-HD Controller | Temperature  | Nominal  | 61.00      | C     | 'OK'
27 | 26-iLO Zone      | Temperature  | Nominal  | 29.00      | C     | 'OK'
35 | 34-PCI 1 Zone    | Temperature  | Nominal  | 21.00      | C     | 'OK'
36 | 35-PCI 2 Zone    | Temperature  | Nominal  | 22.00      | C     | 'OK'
37 | 36-PCI 3 Zone    | Temperature  | Nominal  | 22.00      | C     | 'OK'
41 | 40-I/O Board 1   | Temperature  | Nominal  | 25.00      | C     | 'OK'
43 | 42-VR P1 Zone    | Temperature  | Nominal  | 21.00      | C     | 'OK'
44 | 43-BIOS Zone     | Temperature  | Nominal  | 36.00      | C     | 'OK'
45 | 44-System Board  | Temperature  | Nominal  | 26.00      | C     | 'OK'
46 | 45-SuperCap Max  | Temperature  | Nominal  | 16.00      | C     | 'OK'
47 | 46-Chipset Zone  | Temperature  | Nominal  | 27.00      | C     | 'OK'
48 | 47-Battery Zone  | Temperature  | Nominal  | 25.00      | C     | 'OK'
49 | 48-I/O Zone      | Temperature  | Nominal  | 28.00      | C     | 'OK'
50 | 49-Sys Exhaust   | Temperature  | Nominal  | 26.00      | C     | 'OK'
51 | 50-Sys Exhaust   | Temperature  | Nominal  | 27.00      | C     | 'OK'
54 | Fan 3            | Fan          | Nominal  | N/A        | N/A   | 'transition to Running'
55 | Fan 4            | Fan          | Nominal  | N/A        | N/A   | 'transition to Running'
56 | Fan 5            | Fan          | Nominal  | N/A        | N/A   | 'transition to Running'
57 | Fan 6            | Fan          | Nominal  | N/A        | N/A   | 'transition to Running'
58 | Power Supply 1   | Power Supply | Nominal  | N/A        | N/A   | 'Presence detected'
59 | Power Supply 2   | Power Supply | Nominal  | N/A        | N/A   | 'Presence detected'
60 | Power Meter      | Current      | N/A      | N/A        | N/A   | 'Device Enabled'
61 | Power Supplies   | Power Supply | Nominal  | N/A        | N/A   | 'Fully Redundant'
63 | Fans             | Fan          | Nominal  | N/A        | N/A   | 'Fully Redundant'
65 | Memory           | Memory       | Nominal  | N/A        | N/A   | 'Presence detected'
66 | C1 P1I Bay 1     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
67 | C1 P1I Bay 2     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
68 | C1 P1I Bay 3     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
69 | C1 P1I Bay 4     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
70 | C1 P2I Bay 5     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
71 | C1 P2I Bay 6     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'

--------------------- end of debug output ---------------------

(long line!)

IPMI Status: Critical [58 system event log (SEL) entries present - details: (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 03h 00h 02h 80h 00h), (N/A = N/A, N/A, OEM defined = 00h FFh 09h C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 03h 00h 02h 80h 00h), (N/A = N/A, N/A, OEM defined = 00h B5h 0Ah C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h FFh 00h 05h 00h 00h), (N/A = N/A, N/A, OEM defined = 00h 33h 0Ch C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 02h 00h 04h 84h 00h), (N/A = N/A, N/A, OEM defined = 00h 55h 0Dh C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 03h 00h 02h 80h 00h), (N/A = N/A, N/A, OEM defined = 00h FDh 0Dh C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 01h 00h 01h 80h 00h), (N/A = N/A, N/A, OEM defined = 01h 52h 00h 65h 00h 00h), (N/A = N/A, N/A, OEM defined = 02h 62h 00h 6Fh 00h 00h), (N/A = N/A, N/A, OEM defined = 03h 6Fh 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 04h 69h 00h 6Eh 00h 00h), (N/A = N/A, N/A, OEM defined = 05h 67h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 06h 61h 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 07h 20h 00h 48h 00h 00h), (N/A = N/A, N/A, OEM defined = 08h 50h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 09h 53h 00h 55h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Ah 4Dh 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Bh 75h 00h 73h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Ch 65h 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Dh 20h 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Eh 65h 00h 71h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Fh 75h 00h 65h 00h 00h), (N/A = N/A, N/A, OEM defined = 10h 73h 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 11h 20h 00h 69h 00h 00h), (N/A = N/A, N/A, OEM defined = 12h 6Eh 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 13h 6Fh 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 14h 64h 00h 65h 00h 00h), (N/A = N/A, N/A, OEM defined = 15h 72h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 16h 74h 00h 6Fh 00h 00h), (N/A = N/A, N/A, OEM defined = 17h 20h 00h 61h 00h 00h), (N/A = N/A, N/A, OEM defined = 18h 63h 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 19h 69h 00h 76h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Ah 61h 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Bh 65h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Ch 64h 00h 65h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Dh 70h 00h 6Ch 00h 00h), (N/A = N/A, N/A, OEM defined = 1Eh 6Fh 00h 79h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Fh 65h 00h 64h 00h 00h), (N/A = N/A, N/A, OEM defined = 20h 20h 00h 66h 00h 00h), (N/A = N/A, N/A, OEM defined = 21h 69h 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 22h 6Dh 00h 77h 00h 00h), (N/A = N/A, N/A, OEM defined = 23h 61h 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 24h 65h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 25h 6Fh 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 26h 20h 00h 73h 00h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 02h 00h 02h 80h 00h), (N/A = N/A, N/A, OEM defined = 00h 71h 64h 54h 59h 00h) - fix the reported issues and clear your SEL or exclude specific SEL entries using the -sx or -xST option, Fan = Warning (0)] | '01-Inlet Ambient'=14.00 '02-CPU 1'=40.00 '04-P1 DIMM 1-3'=24.00 '05-P1 DIMM 4-6'=24.00 '06-P1 DIMM 7-9'=23.00 '07-P1 DIMM 10-12'=24.00 '12-HD Max'=35.00 '13-Chipset'=44.00 '14-P/S 1'=20.00 '15-P/S 2'=18.00 '16-P/S 2 Zone'=20.00 '17-VR P1'=26.00 '18-VR P2'=22.00 '19-VR P1 Mem'=23.00 '20-VR P1 Mem'=26.00 '21-VR P2 Mem'=22.00 '22-VR P2 Mem'=23.00 '23-VR P1Vtt Zone'=25.00 '24-VR P2Vtt Zone'=20.00 '25-HD Controller'=61.00 '26-iLO Zone'=29.00 '34-PCI 1 Zone'=21.00 '35-PCI 2 Zone'=22.00 '36-PCI 3 Zone'=22.00 '40-I/O Board 1'=25.00 '42-VR P1 Zone'=21.00 '43-BIOS Zone'=36.00 '44-System Board'=26.00 '45-SuperCap Max'=16.00 '46-Chipset Zone'=27.00 '47-Battery Zone'=25.00 '48-I/O Zone'=28.00 '49-Sys Exhaust'=26.00 '50-Sys Exhaust'=27.00

01-Inlet Ambient = 14.00 (Status: Nominal)
02-CPU 1 = 40.00 (Status: Nominal)
04-P1 DIMM 1-3 = 24.00 (Status: Nominal)
05-P1 DIMM 4-6 = 24.00 (Status: Nominal)
06-P1 DIMM 7-9 = 23.00 (Status: Nominal)
07-P1 DIMM 10-12 = 24.00 (Status: Nominal)
12-HD Max = 35.00 (Status: Nominal)
13-Chipset = 44.00 (Status: Nominal)
14-P/S 1 = 20.00 (Status: Nominal)
15-P/S 2 = 18.00 (Status: Nominal)
16-P/S 2 Zone = 20.00 (Status: Nominal)
17-VR P1 = 26.00 (Status: Nominal)
18-VR P2 = 22.00 (Status: Nominal)
19-VR P1 Mem = 23.00 (Status: Nominal)
20-VR P1 Mem = 26.00 (Status: Nominal)
21-VR P2 Mem = 22.00 (Status: Nominal)
22-VR P2 Mem = 23.00 (Status: Nominal)
23-VR P1Vtt Zone = 25.00 (Status: Nominal)
24-VR P2Vtt Zone = 20.00 (Status: Nominal)
25-HD Controller = 61.00 (Status: Nominal)
26-iLO Zone = 29.00 (Status: Nominal)
34-PCI 1 Zone = 21.00 (Status: Nominal)
35-PCI 2 Zone = 22.00 (Status: Nominal)
36-PCI 3 Zone = 22.00 (Status: Nominal)
40-I/O Board 1 = 25.00 (Status: Nominal)
42-VR P1 Zone = 21.00 (Status: Nominal)
43-BIOS Zone = 36.00 (Status: Nominal)
44-System Board = 26.00 (Status: Nominal)
45-SuperCap Max = 16.00 (Status: Nominal)
46-Chipset Zone = 27.00 (Status: Nominal)
47-Battery Zone = 25.00 (Status: Nominal)
48-I/O Zone = 28.00 (Status: Nominal)
49-Sys Exhaust = 26.00 (Status: Nominal)
50-Sys Exhaust = 27.00 (Status: Nominal)
Fan 3 = 'transition to Running' (Status: Nominal)
Fan 4 = 'transition to Running' (Status: Nominal)
Fan 5 = 'transition to Running' (Status: Nominal)
Fan 6 = 'transition to Running' (Status: Nominal)
Power Supply 1 = 'Presence detected' (Status: Nominal)
Power Supply 2 = 'Presence detected' (Status: Nominal)
Power Supplies = 'Fully Redundant' (Status: Nominal)
Fans = 'Fully Redundant' (Status: Nominal)
Memory = 'Presence detected' (Status: Nominal)
C1 P1I Bay 1 = 'Drive Presence' (Status: Nominal)
C1 P1I Bay 2 = 'Drive Presence' (Status: Nominal)
C1 P1I Bay 3 = 'Drive Presence' (Status: Nominal)
C1 P1I Bay 4 = 'Drive Presence' (Status: Nominal)
C1 P2I Bay 5 = 'Drive Presence' (Status: Nominal)
C1 P2I Bay 6 = 'Drive Presence' (Status: Nominal)

As you can see the fan numbering starts with "Fan 3" and we have 4 summed up. Please explain to me how I can exclude the first two non-present fans. I assume I have to get the sensor IDs for

[-x <sensor id>]

Thanks a lot for your help.

ronator commented 7 years ago

I did not know you can exclude some fans with -x.

/usr/lib/nagios/plugins/check_ipmi_sensor -H 192.168.x.x -f /etc/freeipmi/icinga.conf -fc 4 -vvv

The verbose output is the following:

------------- debug output for sel (-vvv is set): ------------
  /usr/sbin/ipmi-sel was executed with the following parameters:
    /usr/sbin/ipmi-sel -h 192.168.x.x --config-file /etc/freeipmi/icinga.conf --driver-type=LAN_2_0 --output-event-state --interpret-oem-data --entity-sensor-names --sensor-types=all --exclude-sensor-types=
  output of FreeIPMI:
ID | Date        | Time     | Name                                             | Type         | State    | Event
5  | Jan-26-2015 | 10:31:28 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
6  | Jan-26-2015 | 10:31:28 | N/A                                              | N/A              | N/A      | OEM defined = 00h 03h 00h 02h 80h 00h
7  | Jan-26-2015 | 10:34:00 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
8  | Jan-26-2015 | 10:34:00 | N/A                                              | N/A              | N/A      | OEM defined = 00h FFh 09h C6h 54h 00h
9  | Jan-26-2015 | 10:34:41 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
10 | Jan-26-2015 | 10:34:41 | N/A                                              | N/A              | N/A      | OEM defined = 00h 03h 00h 02h 80h 00h
11 | Jan-26-2015 | 10:36:55 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
12 | Jan-26-2015 | 10:36:55 | N/A                                              | N/A              | N/A      | OEM defined = 00h B5h 0Ah C6h 54h 00h
13 | Jan-26-2015 | 10:40:59 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
14 | Jan-26-2015 | 10:40:59 | N/A                                              | N/A              | N/A      | OEM defined = 00h FFh 00h 05h 00h 00h
15 | Jan-26-2015 | 10:43:17 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
16 | Jan-26-2015 | 10:43:17 | N/A                                              | N/A              | N/A      | OEM defined = 00h 33h 0Ch C6h 54h 00h
17 | Jan-26-2015 | 10:45:48 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
18 | Jan-26-2015 | 10:45:48 | N/A                                              | N/A              | N/A      | OEM defined = 00h 02h 00h 04h 84h 00h
19 | Jan-26-2015 | 10:48:09 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
20 | Jan-26-2015 | 10:48:09 | N/A                                              | N/A              | N/A      | OEM defined = 00h 55h 0Dh C6h 54h 00h
21 | Jan-26-2015 | 10:48:35 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
22 | Jan-26-2015 | 10:48:35 | N/A                                              | N/A              | N/A      | OEM defined = 00h 03h 00h 02h 80h 00h
23 | Jan-26-2015 | 10:50:56 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
24 | Jan-26-2015 | 10:50:56 | N/A                                              | N/A              | N/A      | OEM defined = 00h FDh 0Dh C6h 54h 00h
25 | Jan-26-2015 | 11:49:44 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
26 | Jan-26-2015 | 11:49:44 | N/A                                              | N/A              | N/A      | OEM defined = 00h 01h 00h 01h 80h 00h
27 | Jan-26-2015 | 11:49:44 | N/A                                              | N/A              | N/A      | OEM defined = 01h 52h 00h 65h 00h 00h
28 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 02h 62h 00h 6Fh 00h 00h
29 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 03h 6Fh 00h 74h 00h 00h
30 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 04h 69h 00h 6Eh 00h 00h
31 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 05h 67h 00h 20h 00h 00h
32 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 06h 61h 00h 74h 00h 00h
33 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 07h 20h 00h 48h 00h 00h
34 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 08h 50h 00h 20h 00h 00h
35 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 09h 53h 00h 55h 00h 00h
36 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Ah 4Dh 00h 20h 00h 00h
37 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Bh 75h 00h 73h 00h 00h
38 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Ch 65h 00h 72h 00h 00h
39 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Dh 20h 00h 72h 00h 00h
40 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Eh 65h 00h 71h 00h 00h
41 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 0Fh 75h 00h 65h 00h 00h
42 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 10h 73h 00h 74h 00h 00h
43 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 11h 20h 00h 69h 00h 00h
44 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 12h 6Eh 00h 20h 00h 00h
45 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 13h 6Fh 00h 72h 00h 00h
46 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 14h 64h 00h 65h 00h 00h
47 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 15h 72h 00h 20h 00h 00h
48 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 16h 74h 00h 6Fh 00h 00h
49 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 17h 20h 00h 61h 00h 00h
50 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 18h 63h 00h 74h 00h 00h
51 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 19h 69h 00h 76h 00h 00h
52 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Ah 61h 00h 74h 00h 00h
53 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Bh 65h 00h 20h 00h 00h
54 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Ch 64h 00h 65h 00h 00h
55 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Dh 70h 00h 6Ch 00h 00h
56 | Jan-26-2015 | 11:49:45 | N/A                                              | N/A              | N/A      | OEM defined = 1Eh 6Fh 00h 79h 00h 00h
57 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 1Fh 65h 00h 64h 00h 00h
58 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 20h 20h 00h 66h 00h 00h
59 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 21h 69h 00h 72h 00h 00h
60 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 22h 6Dh 00h 77h 00h 00h
61 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 23h 61h 00h 72h 00h 00h
62 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 24h 65h 00h 20h 00h 00h
63 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 25h 6Fh 00h 72h 00h 00h
64 | Jan-26-2015 | 11:49:46 | N/A                                              | N/A              | N/A      | OEM defined = 26h 20h 00h 73h 00h 00h
65 | Jun-29-2017 | 03:20:21 | Sensor #0                                        | OS Critical Stop | Warning  | OS Graceful Shutdown
66 | Jun-29-2017 | 03:20:21 | N/A                                              | N/A              | N/A      | OEM defined = 00h 02h 00h 02h 80h 00h
67 | Jun-29-2017 | 03:22:53 | Sensor #0                                        | OS Boot          | Nominal  | C: boot completed
68 | Jun-29-2017 | 03:22:53 | N/A                                              | N/A              | N/A      | OEM defined = 00h 71h 64h 54h 59h 00h
------------- debug output for sensors (-vvv is set): ------------
  script was executed with the following parameters:
    /usr/lib/nagios/plugins/check_ipmi_sensor -H 192.168.x.x -f /etc/freeipmi/icinga.conf -fc 4 -vvv
  check_ipmi_sensor version:
    3.12
  FreeIPMI version:
    ipmi-sensors - 1.1.5
  FreeIPMI was executed with the following parameters:
    /usr/sbin/ipmi-sensors -h 192.168.x.x --config-file /etc/freeipmi/icinga.conf --quiet-cache --sdr-cache-recreate --interpret-oem-data --output-sensor-state --ignore-not-available-sensors --driver-type=LAN_2_0
  FreeIPMI return code: 0
  output of FreeIPMI:
ID | Name             | Type         | State    | Reading    | Units | Event
0  | UID Light        | OEM Reserved | N/A      | N/A        | N/A   | 'OEM Event = 0001h'
2  | 01-Inlet Ambient | Temperature  | Nominal  | 14.00      | C     | 'OK'
3  | 02-CPU 1         | Temperature  | Nominal  | 40.00      | C     | 'OK'
5  | 04-P1 DIMM 1-3   | Temperature  | Nominal  | 24.00      | C     | 'OK'
6  | 05-P1 DIMM 4-6   | Temperature  | Nominal  | 24.00      | C     | 'OK'
7  | 06-P1 DIMM 7-9   | Temperature  | Nominal  | 23.00      | C     | 'OK'
8  | 07-P1 DIMM 10-12 | Temperature  | Nominal  | 24.00      | C     | 'OK'
13 | 12-HD Max        | Temperature  | Nominal  | 35.00      | C     | 'OK'
14 | 13-Chipset       | Temperature  | Nominal  | 44.00      | C     | 'OK'
15 | 14-P/S 1         | Temperature  | Nominal  | 20.00      | C     | 'OK'
16 | 15-P/S 2         | Temperature  | Nominal  | 18.00      | C     | 'OK'
17 | 16-P/S 2 Zone    | Temperature  | Nominal  | 20.00      | C     | 'OK'
18 | 17-VR P1         | Temperature  | Nominal  | 26.00      | C     | 'OK'
19 | 18-VR P2         | Temperature  | Nominal  | 22.00      | C     | 'OK'
20 | 19-VR P1 Mem     | Temperature  | Nominal  | 23.00      | C     | 'OK'
21 | 20-VR P1 Mem     | Temperature  | Nominal  | 26.00      | C     | 'OK'
22 | 21-VR P2 Mem     | Temperature  | Nominal  | 22.00      | C     | 'OK'
23 | 22-VR P2 Mem     | Temperature  | Nominal  | 23.00      | C     | 'OK'
24 | 23-VR P1Vtt Zone | Temperature  | Nominal  | 25.00      | C     | 'OK'
25 | 24-VR P2Vtt Zone | Temperature  | Nominal  | 20.00      | C     | 'OK'
26 | 25-HD Controller | Temperature  | Nominal  | 61.00      | C     | 'OK'
27 | 26-iLO Zone      | Temperature  | Nominal  | 29.00      | C     | 'OK'
35 | 34-PCI 1 Zone    | Temperature  | Nominal  | 21.00      | C     | 'OK'
36 | 35-PCI 2 Zone    | Temperature  | Nominal  | 22.00      | C     | 'OK'
37 | 36-PCI 3 Zone    | Temperature  | Nominal  | 22.00      | C     | 'OK'
41 | 40-I/O Board 1   | Temperature  | Nominal  | 25.00      | C     | 'OK'
43 | 42-VR P1 Zone    | Temperature  | Nominal  | 21.00      | C     | 'OK'
44 | 43-BIOS Zone     | Temperature  | Nominal  | 36.00      | C     | 'OK'
45 | 44-System Board  | Temperature  | Nominal  | 26.00      | C     | 'OK'
46 | 45-SuperCap Max  | Temperature  | Nominal  | 16.00      | C     | 'OK'
47 | 46-Chipset Zone  | Temperature  | Nominal  | 27.00      | C     | 'OK'
48 | 47-Battery Zone  | Temperature  | Nominal  | 25.00      | C     | 'OK'
49 | 48-I/O Zone      | Temperature  | Nominal  | 28.00      | C     | 'OK'
50 | 49-Sys Exhaust   | Temperature  | Nominal  | 26.00      | C     | 'OK'
51 | 50-Sys Exhaust   | Temperature  | Nominal  | 27.00      | C     | 'OK'
54 | Fan 3            | Fan          | Nominal  | N/A        | N/A   | 'transition to Running'
55 | Fan 4            | Fan          | Nominal  | N/A        | N/A   | 'transition to Running'
56 | Fan 5            | Fan          | Nominal  | N/A        | N/A   | 'transition to Running'
57 | Fan 6            | Fan          | Nominal  | N/A        | N/A   | 'transition to Running'
58 | Power Supply 1   | Power Supply | Nominal  | N/A        | N/A   | 'Presence detected'
59 | Power Supply 2   | Power Supply | Nominal  | N/A        | N/A   | 'Presence detected'
60 | Power Meter      | Current      | N/A      | N/A        | N/A   | 'Device Enabled'
61 | Power Supplies   | Power Supply | Nominal  | N/A        | N/A   | 'Fully Redundant'
63 | Fans             | Fan          | Nominal  | N/A        | N/A   | 'Fully Redundant'
65 | Memory           | Memory       | Nominal  | N/A        | N/A   | 'Presence detected'
66 | C1 P1I Bay 1     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
67 | C1 P1I Bay 2     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
68 | C1 P1I Bay 3     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
69 | C1 P1I Bay 4     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
70 | C1 P2I Bay 5     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'
71 | C1 P2I Bay 6     | Drive Slot   | Nominal  | N/A        | N/A   | 'Drive Presence'

--------------------- end of debug output ---------------------

(long line!)

IPMI Status: Critical [58 system event log (SEL) entries present - details: (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 03h 00h 02h 80h 00h), (N/A = N/A, N/A, OEM defined = 00h FFh 09h C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 03h 00h 02h 80h 00h), (N/A = N/A, N/A, OEM defined = 00h B5h 0Ah C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h FFh 00h 05h 00h 00h), (N/A = N/A, N/A, OEM defined = 00h 33h 0Ch C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 02h 00h 04h 84h 00h), (N/A = N/A, N/A, OEM defined = 00h 55h 0Dh C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 03h 00h 02h 80h 00h), (N/A = N/A, N/A, OEM defined = 00h FDh 0Dh C6h 54h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 01h 00h 01h 80h 00h), (N/A = N/A, N/A, OEM defined = 01h 52h 00h 65h 00h 00h), (N/A = N/A, N/A, OEM defined = 02h 62h 00h 6Fh 00h 00h), (N/A = N/A, N/A, OEM defined = 03h 6Fh 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 04h 69h 00h 6Eh 00h 00h), (N/A = N/A, N/A, OEM defined = 05h 67h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 06h 61h 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 07h 20h 00h 48h 00h 00h), (N/A = N/A, N/A, OEM defined = 08h 50h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 09h 53h 00h 55h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Ah 4Dh 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Bh 75h 00h 73h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Ch 65h 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Dh 20h 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Eh 65h 00h 71h 00h 00h), (N/A = N/A, N/A, OEM defined = 0Fh 75h 00h 65h 00h 00h), (N/A = N/A, N/A, OEM defined = 10h 73h 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 11h 20h 00h 69h 00h 00h), (N/A = N/A, N/A, OEM defined = 12h 6Eh 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 13h 6Fh 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 14h 64h 00h 65h 00h 00h), (N/A = N/A, N/A, OEM defined = 15h 72h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 16h 74h 00h 6Fh 00h 00h), (N/A = N/A, N/A, OEM defined = 17h 20h 00h 61h 00h 00h), (N/A = N/A, N/A, OEM defined = 18h 63h 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 19h 69h 00h 76h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Ah 61h 00h 74h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Bh 65h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Ch 64h 00h 65h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Dh 70h 00h 6Ch 00h 00h), (N/A = N/A, N/A, OEM defined = 1Eh 6Fh 00h 79h 00h 00h), (N/A = N/A, N/A, OEM defined = 1Fh 65h 00h 64h 00h 00h), (N/A = N/A, N/A, OEM defined = 20h 20h 00h 66h 00h 00h), (N/A = N/A, N/A, OEM defined = 21h 69h 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 22h 6Dh 00h 77h 00h 00h), (N/A = N/A, N/A, OEM defined = 23h 61h 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 24h 65h 00h 20h 00h 00h), (N/A = N/A, N/A, OEM defined = 25h 6Fh 00h 72h 00h 00h), (N/A = N/A, N/A, OEM defined = 26h 20h 00h 73h 00h 00h), (Sensor #0 = Warning, OS Critical Stop, OS Graceful Shutdown), (N/A = N/A, N/A, OEM defined = 00h 02h 00h 02h 80h 00h), (N/A = N/A, N/A, OEM defined = 00h 71h 64h 54h 59h 00h) - fix the reported issues and clear your SEL or exclude specific SEL entries using the -sx or -xST option, Fan = Warning (0)] | '01-Inlet Ambient'=14.00 '02-CPU 1'=40.00 '04-P1 DIMM 1-3'=24.00 '05-P1 DIMM 4-6'=24.00 '06-P1 DIMM 7-9'=23.00 '07-P1 DIMM 10-12'=24.00 '12-HD Max'=35.00 '13-Chipset'=44.00 '14-P/S 1'=20.00 '15-P/S 2'=18.00 '16-P/S 2 Zone'=20.00 '17-VR P1'=26.00 '18-VR P2'=22.00 '19-VR P1 Mem'=23.00 '20-VR P1 Mem'=26.00 '21-VR P2 Mem'=22.00 '22-VR P2 Mem'=23.00 '23-VR P1Vtt Zone'=25.00 '24-VR P2Vtt Zone'=20.00 '25-HD Controller'=61.00 '26-iLO Zone'=29.00 '34-PCI 1 Zone'=21.00 '35-PCI 2 Zone'=22.00 '36-PCI 3 Zone'=22.00 '40-I/O Board 1'=25.00 '42-VR P1 Zone'=21.00 '43-BIOS Zone'=36.00 '44-System Board'=26.00 '45-SuperCap Max'=16.00 '46-Chipset Zone'=27.00 '47-Battery Zone'=25.00 '48-I/O Zone'=28.00 '49-Sys Exhaust'=26.00 '50-Sys Exhaust'=27.00

01-Inlet Ambient = 14.00 (Status: Nominal)
02-CPU 1 = 40.00 (Status: Nominal)
04-P1 DIMM 1-3 = 24.00 (Status: Nominal)
05-P1 DIMM 4-6 = 24.00 (Status: Nominal)
06-P1 DIMM 7-9 = 23.00 (Status: Nominal)
07-P1 DIMM 10-12 = 24.00 (Status: Nominal)
12-HD Max = 35.00 (Status: Nominal)
13-Chipset = 44.00 (Status: Nominal)
14-P/S 1 = 20.00 (Status: Nominal)
15-P/S 2 = 18.00 (Status: Nominal)
16-P/S 2 Zone = 20.00 (Status: Nominal)
17-VR P1 = 26.00 (Status: Nominal)
18-VR P2 = 22.00 (Status: Nominal)
19-VR P1 Mem = 23.00 (Status: Nominal)
20-VR P1 Mem = 26.00 (Status: Nominal)
21-VR P2 Mem = 22.00 (Status: Nominal)
22-VR P2 Mem = 23.00 (Status: Nominal)
23-VR P1Vtt Zone = 25.00 (Status: Nominal)
24-VR P2Vtt Zone = 20.00 (Status: Nominal)
25-HD Controller = 61.00 (Status: Nominal)
26-iLO Zone = 29.00 (Status: Nominal)
34-PCI 1 Zone = 21.00 (Status: Nominal)
35-PCI 2 Zone = 22.00 (Status: Nominal)
36-PCI 3 Zone = 22.00 (Status: Nominal)
40-I/O Board 1 = 25.00 (Status: Nominal)
42-VR P1 Zone = 21.00 (Status: Nominal)
43-BIOS Zone = 36.00 (Status: Nominal)
44-System Board = 26.00 (Status: Nominal)
45-SuperCap Max = 16.00 (Status: Nominal)
46-Chipset Zone = 27.00 (Status: Nominal)
47-Battery Zone = 25.00 (Status: Nominal)
48-I/O Zone = 28.00 (Status: Nominal)
49-Sys Exhaust = 26.00 (Status: Nominal)
50-Sys Exhaust = 27.00 (Status: Nominal)
Fan 3 = 'transition to Running' (Status: Nominal)
Fan 4 = 'transition to Running' (Status: Nominal)
Fan 5 = 'transition to Running' (Status: Nominal)
Fan 6 = 'transition to Running' (Status: Nominal)
Power Supply 1 = 'Presence detected' (Status: Nominal)
Power Supply 2 = 'Presence detected' (Status: Nominal)
Power Supplies = 'Fully Redundant' (Status: Nominal)
Fans = 'Fully Redundant' (Status: Nominal)
Memory = 'Presence detected' (Status: Nominal)
C1 P1I Bay 1 = 'Drive Presence' (Status: Nominal)
C1 P1I Bay 2 = 'Drive Presence' (Status: Nominal)
C1 P1I Bay 3 = 'Drive Presence' (Status: Nominal)
C1 P1I Bay 4 = 'Drive Presence' (Status: Nominal)
C1 P2I Bay 5 = 'Drive Presence' (Status: Nominal)
C1 P2I Bay 6 = 'Drive Presence' (Status: Nominal)

As you can see the fan numbering starts with "Fan 3" and we have 4 summed up. Please explain to me how I can exclude the first two non-present fans. I assume I have to get the sensor IDs for

[-x <sensor id>]

Thanks a lot for your help.

ronator commented 7 years ago

I forgot to answer, I had this site opened as a tab for like two weeks and just found it - my bad.

tk-wfischer commented 7 years ago

Hi, sorry also from my side for overlooking your answer 14days ago.

From your current debug output, it seems that only the SEL (system event log) entries are causing the critical state ("58 system event log (SEL) entries present").

You have two options:

  1. Do not monitor SEL entries by using the "--nosel" option of the plugin
  2. Delete the SEL entries (e.g. by FreeIPMI's "ipmi-sel --clear" - see https://www.gnu.org/software/freeipmi/manpages/man8/ipmi-sel.8.html or by "ipmitool sel clear" or within the Web Interface of your BMC.

I would recommend option 2. The reason is that there could be possible errors with many servers do not monitor with the normal sensors reported by "ipmi-sensors", but which can cause new entries in the system event log. An example for this kind of errors are e.g. Memory warnings like "DIMM3 correctable ECC error detected" or the like.

gschoenberger commented 6 years ago

Closing this due to inactivity