Closed abelbeck closed 2 years ago
I haven't seen this happen anywhere previously either. Based on the output, I suspect this is some sort of issue in the kernel but I have no idea how to do kernel debugging in this sort of situation. Few questions that may help understand this issue further:
vnstat
return or was there some error from vnstatd
?vnstat -D -tr
says as that would try to just access the interface statistics in the same way the daemon does, if that works then starting the daemon manually with vnstatd -D -n
may provide further cluesMy only guess it that there could by some issue with the virtio_net
device kernel module but in that case I would have expected this to be a more common issue. Not sure if the hypervisor Linode provides could cause this sort of fault.
Hi @vergoh
how reproducible is this?
Not reproducible, so far it has happened only 2 times, at these times:
Apr 18 23:00:00
- Linux 4.19.236 #1 SMP PREEMPT x86_64 GNU/Linux
- vnStat 2.9
- sqlite3 3.38.2
Jul 27 06:00:00
- Linux 4.19.248 #1 SMP PREEMPT x86_64 GNU/Linux
- vnStat 2.9
- sqlite3 3.39.2
did this start happening after a kernel upgrade?
A minor kernel bump occurred from 4.19.230 to 4.19.236 on Mar 26 2022. No immediate correlation.
what error did
vnstat
return or was there some error fromvnstatd
?
Sorry I did not note the error, something like "could not open database" or such.
if this happens again, see what
vnstat -D -tr
says as that would try to just access the interface statistics in the same way the daemon does, if that works then starting the daemon manually withvnstatd -D -n
may provide further clues
OK, thanks, I will try that next time.
My only guess it that there could by some issue with the
virtio_net
device kernel module but in that case I would have expected this to be a more common issue. Not sure if the hypervisor Linode provides could cause this sort of fault.
FYI, here are the recent virtio_net
4.19.y kernel changes:
2022-03-02 4.19.232 https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?h=linux-4.19.y&id=e9ffbe63f6f32f526a461756309b61c395168d73 2022-07-02 4.19.250 https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?h=linux-4.19.y&id=9222672fa6370f0ec3d899662cb8680e9282fc4c 2022-07-07 4.19.251 https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?h=linux-4.19.y&id=eb12a6398ee3bd33f4400bed756e72c23cc3d7f7
I am now running Linux 4.19.254
. I will update if/when I get more data.
BTW, I also test on a single core Vultr instance (KVM similar to Linode, using virtio_net
) and have never seen this issue.
Testing with:
- Linux 4.19.254 #1 SMP PREEMPT x86_64 GNU/Linux
- vnStat 2.9
- sqlite3 3.39.3
This issue has not occurred again for the last two months.
Closing, presume a fix to the Linux kernel solved the problem.
vnstat
is a standard part of our AstLinux Project, cross-compiled from source.vnstat
is a real gem, works perfectly ... except (twice now that I noticed) on a single core Linode (Custom ISO) instancevnstatd
causes a kernel "NULL pointer dereference"The only interface
vnstat
monitors is onevirtio_net
device,vnstat
only returned errors afterJul 27 06:00:00
(below).A system reboot was required to fix
vnstat
.I have not ever seen this issue anywhere else, be it a VM or bare-metal box.
Basic debugging below: