quantum / esos

An open source, high performance, block-level storage platform.
http://www.esos-project.com/
Other
279 stars 57 forks source link

Hardware health broken? #248

Open direktornswe opened 4 years ago

direktornswe commented 4 years ago

I followed the new 2.x style installation where hardware raid controller tools are installed after setup. I'm on a LSI/Brocade/MegaRaid controller and downloaded StorCLI without issues and its working fine in console.

So I wanted to move ahead and make sure that healthchecks works, and to my surplice it seems it's not. health_chk.sh refers to MegaCli64, that is not installed. /usr/local/sbin/hw_raid_cli.py seems to included but I couldn't fint out what it does, when it comes to health checks. Running the command just skips MegaRAID.

[root@san01 ~]# /usr/local/sbin/health_chk.sh 
Checking hardware RAID logical drives...

Checking hardware RAID physical drives...

It appears the '/opt/sbin/MegaCli64' tool is not installed, or at least
is not executable. Skipping additional MegaRAID health checks...

I just check another storage array, running 1.3.5, and it showed the exact same issue. Running 2.0 on the first SAN node.

1 Is there a way to get hardware monitoring working? 2 would it be possible to get SNMP working? It's mainly to monitor HD temperature and that's currently not possible.,

msmith626 commented 4 years ago

Can you provide a directory list of "/opt/sbin/" on your system? Does the "MegaCli64" tool indeed not exist in that directory? Can you provide output/steps of how you installed the tool?