Closed hakasapl closed 2 months ago
Sampling rate and accuracy requirements are going to vary depending on the use case. I think Jonathan Appavoo and Han Dong will have the most strict requirements, so let's collect those. We may not want to collect at that level for all machines. Ideally, it would be good to be able to have sampling interval be configurable.
As a ESI hardware administrator it would be great to get alerts about:
We can get all of this and more from iDrac, all out-of-band.
Closing this for now, I think we have a good plan
In parallel with NERC, we want to monitor and alert for metrics for all ESI/OCT nodes from PDUs as well as IPMI data.
The purpose of this issue is to outline requirements and design a plan.
Requirements:
Unknown Requirements:
Proposed Solution:
Initial slack discussion: