Seagate / cortx-monitor

CORTX Monitor tracks platform health and raises alerts on sensing any unintended state. It can detect hardware faults/ removal/ replacement by continuously sensing sub-systems like Storage Enclosure, Node Servers and components, network interfaces.
https://github.com/Seagate/cortx
GNU Affero General Public License v3.0
4 stars 38 forks source link

EOS-19902: Monitor server HBA card & raise health alerts(Sensor) #533

Closed mariyappanp closed 2 years ago

mariyappanp commented 2 years ago

Problem Statement:

SSPL doesnt monitor HBA cards as of today. We will understand the ways & possibilities for same through spike tkt EOS-19901 and then implement monitoring for same.

Solution Design:

https://seagate-systems.atlassian.net/wiki/spaces/sspl/pages/825262182/HBA+Card+Monitoring

Coding:

Testing:

Integration:

PR checklist:

stale[bot] commented 2 years ago

This issue/pull request has been marked as needs attention as it has been left pending without new activity for 4 days. Tagging @indrajitzagade @thavanathan for appropriate assignment. Sorry for the delay & Thank you for contributing to CORTX. We will get back to you as soon as possible.

stale[bot] commented 2 years ago

This issue/pull request has been marked as needs attention as it has been left pending without new activity for 4 days. Tagging @indrajitzagade @thavanathan for appropriate assignment. Sorry for the delay & Thank you for contributing to CORTX. We will get back to you as soon as possible.