ZenithControlLabs / Zenith_Lib

0 stars 1 forks source link

Total system lockups on Linux #5

Open panther03 opened 11 months ago

panther03 commented 11 months ago

I'm reporting on this because I see it as a total possibility that Phobri is the culprit and I don't want it to go undocumented, but it also could be totally my system's fault.

Randomly, on my dev machine running Ubuntu 22.04, my system will totally lock up for a few seconds, resume for just a bit, and then 100% freeze up. During the initial lock up period, sound still plays, but afterwards it's just fucked.

The catch is: I've only noticed this with Phobri connected (albeit sample size = 2, but I don't have it connected terribly often).

Kernel logs also seem to suggest a USB problem:

Sep 24 11:39:31 triangulum kernel: [52811.393011] xhci_hcd 0000:05:00.4: Timeout while waiting for stop endpoint com
mand
Sep 24 11:39:33 triangulum kernel: [52813.728826] [drm:drm_crtc_commit_wait [drm]] *ERROR* flip_done timed out
Sep 24 11:39:33 triangulum kernel: [52813.728867] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ER
ROR* [CRTC:67:crtc-0] commit wait timed out
Sep 24 11:39:33 triangulum kernel: [52813.728939] [drm:drm_atomic_helper_wait_for_flip_done [drm_kms_helper]] *ERROR
* [CRTC:67:crtc-0] flip_done timed out
Sep 24 11:39:34 triangulum kernel: [52814.241059] iwlwifi 0000:04:00.0: Queue 10 is active on fifo 1 and stuck for 1
0000 ms. SW [236, 23] HW [23, 23] FH TRB=0x0c010a016
Sep 24 11:39:34 triangulum kernel: [52814.272837] [drm] Fence fallback timer expired on ring gfx
Sep 24 11:39:42 triangulum kernel: [52821.920826] xhci_hcd 0000:05:00.4: xHCI host not responding to stop endpoint command.
Sep 24 11:39:42 triangulum kernel: [52821.920832] xhci_hcd 0000:05:00.4: USBSTS: 0x00000018 EINT PCD
Sep 24 11:39:42 triangulum kernel: [52821.924815] xhci_hcd 0000:05:00.4: xHCI host controller not responding, assume dead
Sep 24 11:39:42 triangulum kernel: [52821.924815] xhci_hcd 0000:05:00.4: HC died; cleaning up
Sep 24 11:39:42 triangulum kernel: [52821.941098] usb 3-1: USB disconnect, device number 2
Sep 24 11:39:42 triangulum kernel: [52821.949109] usb 3-2: USB disconnect, device number 3
Sep 24 11:39:42 triangulum kernel: [52822.560845] [drm] Fence fallback timer expired on ring gfx
Sep 24 11:39:44 triangulum kernel: [52823.968926] [drm:drm_crtc_commit_wait [drm]] *ERROR* flip_done timed out
Sep 24 11:39:44 triangulum kernel: [52823.968966] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [PLANE:55:plane-3] commit wait timed out
Sep 24 11:39:44 triangulum kernel: [52824.340554] watchdog: BUG: soft lockup - CPU#4 stuck for 22s! [colord-sane:53926]

I will note I am having other problems with USB. Certain mouses randomly don't work or behave very strangely. The USB controller in my system could be fucked, or perhaps the kernel driver is too.

panther03 commented 11 months ago

Noticed this happening with the N64 Remapper as well. There's a good chance it's something in the firmware, but the low-level USB code hasn't been changed in a long time, before I never noticed this happening - so I am also entertaining the fact that my laptop is just screwed.