Open boomam opened 12 months ago
hi @boomam
when running smartctl within the container manually (for testing) you should use smartctl --xall --json /dev/sdg
not smartctl --xall --json /dev/sdg" type=metrics
hi @boomam when running smartctl within the container manually (for testing) you should use
smartctl --xall --json /dev/sdg
notsmartctl --xall --json /dev/sdg" type=metrics
Was just running what was in the log output ;-)
Running the new command generates an extensive output of smart stats - do you want them for diagnosis?
just so I understand the issue a bit better, can you clarify a couple of things?
smartctl
are usually handled correctly (ignored) as long as the JSON payload is correct (and SMART status is "passed"). Are you seeing a failed SMART attribute in the UI?
what is the error in the UI exactly? Can you include screenshots? Attached.
![]()
Are you seeing a failed SMART attribute in the UI? this could be due to Backblaze analysis.
Yes, the attributes mostly all show.
Capacity doesn't show however, but im not too concerned with that right now.
Re: Backblaze -
Possibly, I know the MX500's v1 got a bad rap for failures based on one of the attributes incorrectly reporting, however a few firmware updates and a new revision later and its all good.
If Backblaze is used as the source of truth, then that could explain flagging it as failed i guess?
Although if that is the case, I would probably suggest we look at changing the status from 'failed' to something that shows its not failed, but is likely to based on historical nature.
Describe the bug Understanding the guidence here, the Crucial MX500 array I am monitoring reads as failing SMART, when a manual run of smartctl on the host being monitored via the collector, shows the drive as passed/working.
There is suggestion of 'exit code 4', from the collector, but the drives otherwise report stats. Running the command referenced on the collector manually,
smartctl --xall --json /dev/sdg" type=metrics
does not actually work, instead hanging on a>
prompt.Net result is incorrect status in GUI, and thus incorrect reporting via notification methods.
Expected behavior Correct reporting of SMART state/attributes.
Screenshots Not a screenshot, but localized output of smartctl -
Log Files From collector.
in another terminal trigger the collector
docker exec scrutiny scrutiny-collector-metrics run
The log files will be available on your host in the
config
directory. Please attach them to this issue. no logs existPlease also provide the output of
docker info
n/a