WebThingsIO / gateway

WebThings Gateway - a self-hosted web application for monitoring and controlling a building over the web
http://webthings.io/gateway
Mozilla Public License 2.0
2.62k stars 339 forks source link

Ubuntu Core Kernel Panics on Intel NUC: BUG: unable to handle page fault for address #3174

Open benfrancis opened 1 month ago

benfrancis commented 1 month ago

Note: I don't think this is actually a WebThings Gateway bug, it's an issue with Ubuntu Core. I'm filing it here for now because I don't yet have an upstream issue to track.


I'm testing WebThings Gateway on an Intel NUC 11. Specifically the NUC 11 CMCR1ABA i3 (Austin Beach rugged chassis + Elk Bay compute module). I don't think this particular version of the Intel NUC is officially certified by Canonical, but it does run Ubuntu Core (ubuntu-core-22-amd64.img).

However, I've recently noticed that when I leave it running for long periods of time (e.g. overnight) Ubuntu Core freezes with what I think is a memory-related kernel panic.

I've attached screenshots (sorry for the photos but because the machine freezes it's difficult for me to capture the text) of two separate occasions when this happened, with slightly different errors:

BUG: unable to handle page fault for address: ffffffffc121f1c0
#PF: supervisor read access in kernel mode
#PF: error_code(0x0009) - reserved bit violation

PXL_20241011_091155155(3)

BUG: unable to handle page fault for address: fffffffffffffff8
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page

PXL_20241010_154119004(1)

My first thought was that it could be a hardware problem, but at least on a first pass memtest hasn't found any faults, see attached screenshot. (Update: No failures after 12 passes).

PXL_20241011_115433093 MP(1)

I've seen quite a few open bugs of this nature on Launchpad with no resolution so I've contacted Canonical for support.