hetrixtools / agent

HetrixTools Server Monitoring Agent (Linux)
https://hetrixtools.com/uptime-monitor/
88 stars 29 forks source link

Option to skip disk check for a certain node (also, no swap causes division by zero once again) #55

Closed zejjnt closed 1 year ago

zejjnt commented 1 year ago

One of my USB-hubs has a built-in memory card reader which makes the agent hog huge system resources, never report any data and not prompt any error messages if not manually checking hetrixtools_cron.log. I don't know how I can assist in the troubleshooting, but here is that log (the device in question is /dev/sdc).

hetrixtools_cron.log

zejjnt commented 1 year ago

Seems like something additional is wrong here; the awk problem remains even after disabling disk health checking. Is there any way to get any useful info at all except for dissecting the script statement by statement? hetrixtools_cron2.log

zejjnt commented 1 year ago

Couldn't resist doing a bit of digging. Once again (#38) you assume that everyone has a swap file, just like last time. That's the division by zero. The disk problem is probably a separate issue though but please, not everyone uses swap.

hetrixtools commented 1 year ago

Hello,

Thank you for reporting these issues.

The division by zero error has now been fixed in version 2.0.10 https://github.com/hetrixtools/agent/commit/c76788e901887378c8a10202b4be00b22442787e

For the disk related issue, please open a support ticket on our platform, as we'll need more info in order to try and reproduce the issue on our testing environments.