We're running the NHC 1.4.3 RC1 RPM lbnl-nhc-1.4.3-1.el8.noarch on ~100 AlmaLinux 8.5 systems.
These servers have Cornelis (Intel) Omni-Path 100 Gbit adapters, and I check them with this rule in nhc.conf:
d*.nifl.fysik.dtu.dk || check_hw_ib 100
Due to some hardware testing I removed the adapter, and now NHC rightly gives an error message, albeit a strange one:
ERROR: nhc: Health check failed: check_hw_ib: Version mismatch between kernel OFED drivers and userspace OFED libraries.
I wonder if a more informative error message could be issued, such as "missing network interface" or similar?
Thanks,
Ole
We're running the NHC 1.4.3 RC1 RPM lbnl-nhc-1.4.3-1.el8.noarch on ~100 AlmaLinux 8.5 systems. These servers have Cornelis (Intel) Omni-Path 100 Gbit adapters, and I check them with this rule in nhc.conf:
Due to some hardware testing I removed the adapter, and now NHC rightly gives an error message, albeit a strange one:
I wonder if a more informative error message could be issued, such as "missing network interface" or similar? Thanks, Ole