librenms / librenms-agent

LibreNMS Agent & Scripts
GNU General Public License v2.0
116 stars 187 forks source link

ZFS plugin returns SSD cache errors. #475

Open Fever-Wits opened 1 year ago

Fever-Wits commented 1 year ago

Hello, I use the zfs plugin to monitor the zfs pool. I have added an SSD for cache. I occasionally get the following error in librenms: ZFS L2 cache has experienced errors

As far as I understand, the plugin outputs information from: /proc/spl/kstat/zfs/arcstats As far as I understand the script takes l2_writes_error, l2_cksum_bad, l2_io_error and outputs them.

I am trying to understand what the errors are and what they are due to. I call the Dell R720 + HBA to the front backplane. OS: CentOS 9 Streem In zpool status I have no SSD errors. I looked through the logs and I don't see any errors anywhere. Can anyone direct me to what I should be looking for as a problem or is it just the script reporting wrong

# cat /proc/spl/kstat/zfs/arcstats | grep err
l2_writes_error 4 0
l2_io_error 4 0
l2_rebuild_io_errors 4 0
l2_rebuild_dh_errors 4 0
l2_rebuild_cksum_lb_errors 4 0
# ./scripts/zfs | jq | grep err
   "errorString": "",
   "error": 0,
     "l2_rebuild_cksum_lb_errors": "0",
     "l2_writes_error": 0,
     "l2_errors": 182,
     "l2_rebuild_io_errors": "0",
     "l2_io_error": 0,
     "l2_rebuild_dh_errors": "0",
# zpool status
   pool: gluster
  state: ONLINE
configuration:

         NAME STATE READ WRITE CKSUM
         gluster ONLINE 0 0 0
           mirror-0 ONLINE 0 0 0
             wwn-0x50014ee2bf64a904 ONLINE 0 0 0
             wwn-0x50014ee26a75fb01 ONLINE 0 0 0
           mirror-1 ONLINE 0 0 0
             scsi-350014ee214b999f7 ONLINE 0 0 0
             wwn-0x50014ee2bf649b07 ONLINE 0 0 0
           mirror-2 ONLINE 0 0 0
             wwn-0x50014ee2bf64ae58 ONLINE 0 0 0
             wwn-0x50014ee214b97135 ONLINE 0 0 0
           mirror-3 ONLINE 0 0 0
             wwn-0x50014ee214b98557 ONLINE 0 0 0
             wwn-0x50014ee26a0eb0da ONLINE 0 0 0
         cache
           scsi-SPliant_LB806M_41061188 ONLINE 0 0 0
         spares
           scsi-SATA_WDC_WD20EFRX-68E_WD-WCC4M5PKF7P3 AVAIL

errors: No known data errors

Regards,