sonic-net / sonic-platform-daemons

Platform module daemons for SONiC
Other
25 stars 159 forks source link

Implementation of a Monitoring Daemon for storage devices in SONiC switches #433

Closed assrinivasan closed 6 months ago

assrinivasan commented 9 months ago

Description

This commit adds a monitoring daemon for Storage device attributes on a device running SONiC. SONiC Storage Monitoring Daemon HLD

Motivation and Context

Storage devices experience performance degradation over time on account of a variety of factors such as overall disk writes, bad-blocks management, lack of free space, sub-optimal operational temperature and good-old wear-and-tear which speaks to the overall health of the disk.

The goal of the Storage Monitoring Daemon (storagemond) is to provide meaningful metrics for the aforementioned issues and enable streaming telemetry for these attributes so that the required preventative measures are triggered in the eventuality of performance degradation.

How Has This Been Tested?

Has been manually tested on following platforms:

7050cx3.txt S6100.txt SN2700.txt

Additional Information (Optional)

prgeor commented 6 months ago

@assrinivasan please add more details for manual testing.

sonc image upgrade, reboot, crash, fast/warm reboot

assrinivasan commented 6 months ago

@assrinivasan please add more details for manual testing.

sonc image upgrade, reboot, crash, fast/warm reboot

Added to the PR.

linux-foundation-easycla[bot] commented 6 months ago

CLA Signed

The committers listed above are authorized under a signed CLA.

assrinivasan commented 6 months ago

/azpw run

mssonicbld commented 6 months ago

/AzurePipelines run

azure-pipelines[bot] commented 6 months ago
Azure Pipelines successfully started running 1 pipeline(s).
assrinivasan commented 6 months ago

/azpw run

mssonicbld commented 6 months ago

/AzurePipelines run

azure-pipelines[bot] commented 6 months ago
Azure Pipelines successfully started running 1 pipeline(s).