AnalogJ / scrutiny

Hard Drive S.M.A.R.T Monitoring, Historical Trends & Real World Failure Thresholds
MIT License
5.36k stars 171 forks source link

[BUG] TrueNas Scale - All NVMe drives show as failed #721

Open panzerscope opened 1 day ago

panzerscope commented 1 day ago

Describe the bug The app shows all NVMe drives as FAILED. This is highly unlikely to be the case as at least two of the drives are only 145 days old from the discovery of this bug.

Expected behavior For the NVMe drives to have PASSED, or at minimum for some of them to have passed.

Information

I am running TrueNas Scale ElectricEel-24.10.0.2 and Scrutiny version 0.8.1. During the Scrutiny app installation I initially did NOT have "Warning: Elevate Privileges" enabled which meant even though the NVMe drives were seen, value and readings were missing as seen below.

image

At this point I edited the settings and enabled "Warning: Elevate Privileges" after which additional values were now being detected, however the drives still showed as failed. as per the below screenshot.

image

If logs files are required to assist with diagnostics, please can you instruct me on how to get the appropriate logs using TrueNas.

Many thanks!

dspalu32 commented 1 day ago

I saw precisely the same set of circumstances on TrueNAS: needing to re-run with elevated privileges to get NVMe drives to read correctly, but then seeing the drive marked as failed despite now having 'good' data.

I simply nuked the the config and db and restarted. All is well. I'm guessing the failure status may have self-cleared after some refresh cycle, but I had no history to loose.

(rm -r /mnt/.ix-apps/app_mounts/scrutiny/config/ /mnt/.ix-apps/app_mounts/scrutiny/influxdb/)

panzerscope commented 6 hours ago

I saw precisely the same set of circumstances on TrueNAS: needing to re-run with elevated privileges to get NVMe drives to read correctly, but then seeing the drive marked as failed despite now having 'good' data.

I simply nuked the the config and db and restarted. All is well. I'm guessing the failure status may have self-cleared after some refresh cycle, but I had no history to loose.

(rm -r /mnt/.ix-apps/app_mounts/scrutiny/config/ /mnt/.ix-apps/app_mounts/scrutiny/influxdb/)

Thanks, Just deleting the config file seems to have fixed the issue. I will create another bug report however as for some reason my images are missing from the app in the noted locations lol

image