thesofproject / linux

Linux kernel source tree
Other
91 stars 133 forks source link

[LNL] rcu_preempt self-detected stall on CPU #5008

Closed marc-hb closed 4 months ago

marc-hb commented 6 months ago

Spotted in daily test run 41267?model=LNLM_RVP_NOCODEC&testcase=check-suspend-resume-with-capture-5

Is the suspend/resume test pushing something "too hard"? https://www.kernel.org/doc/html/latest/RCU/stallwarn.html

Funny enough, the number of seconds in the error message climbs up but the date stays at May 20 15:52:21!

root@ba-lnlm-rvp-nocodec-01:~# journalctl -b -2 -g stuck
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 45s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 71s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 104s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 130s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 160s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 186s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 220s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 246s! [rtcwake:72543]

Start Time: 2024-05-20 13:02:40 UTC Linux Branch: topic/sof-dev Linux Commit: d99d9a0ab917 KConfig Branch: master KConfig Commit: 8fee06f8fd8a

SOF Branch: main SOF Commit: 69249fb75b86 Zephyr Commit: e97d33d0c896

May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu: INFO: rcu_preempt self-detected stall on CPU
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         1-....: (20981 ticks this GP) idle=2714/1/0x4000000000000000 softirq=1101696/1101696 fqs=1989
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         (t=21000 jiffies g=1105033 q=94 ncpus=8)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: CPU: 1 PID: 72543 Comm: rtcwake Not tainted 6.9.0-rc5-gd99d9a0ab917 #dev
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Hardware name: Intel Corporation Lunar Lake Client Platform/LNL-M LP5 RVP1, BIOS LNLMFWI1.R00.2470.D84.2312061937 12>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RIP: 0010:smp_call_function_single+0xfa/0x140
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Code: 25 28 00 00 00 75 55 c9 c3 cc cc cc cc 48 89 e6 48 89 54 24 18 4c 89 44 24 10 e8 61 fe ff ff 8b 54 24 08 83 e2>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RSP: 0018:ffffbc84868abc80 EFLAGS: 00000202
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RAX: 0000000000000000 RBX: ffffa3824b54b400 RCX: ffffa3824ee55978
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RDX: 0000000000000001 RSI: ffffbc84868abc80 RDI: ffffbc84868abc80
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RBP: ffffbc84868abcc8 R08: ffffffff922c5250 R09: 0000000000000001
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: R10: ffffbc84868abd20 R11: 0000000000001281 R12: 00000882605b0a4b
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: R13: ffffa3824ee22000 R14: ffffbc84868abd90 R15: 0000000000000000
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: FS:  00007f9c1a9d6740(0000) GS:ffffa389a0a00000(0000) knlGS:0000000000000000
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: CR2: 00007f4f665c63c0 CR3: 000000010939c002 CR4: 0000000000f70ef0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: DR3: 0000000000000000 DR6: 00000000ffff07f0 DR7: 0000000000000400
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: PKRU: 55555554
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Call Trace:
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  <IRQ>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? rcu_dump_cpu_stacks+0xf9/0x160
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? rcu_sched_clock_irq+0x5c4/0xef0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? find_held_lock+0x32/0x90
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? local_clock_noinstr+0xd/0xc0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? local_clock+0x15/0x30
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? lock_release+0x26a/0x3e0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? get_jiffies_update+0x44/0x90
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? update_process_times+0x6d/0xb0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? tick_nohz_handler+0xc9/0x120
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? __pfx_tick_nohz_handler+0x10/0x10
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? __hrtimer_run_queues+0xf8/0x330
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? hrtimer_interrupt+0x103/0x240
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? __sysvec_apic_timer_interrupt+0x6c/0x1f0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? sysvec_apic_timer_interrupt+0x6b/0x80
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  </IRQ>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  <TASK>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? asm_sysvec_apic_timer_interrupt+0x1a/0x20
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? __pfx___wrmsr_on_cpu+0x10/0x10
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? smp_call_function_single+0xfa/0x140
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ? __pfx___wrmsr_on_cpu+0x10/0x10
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  wrmsrl_on_cpu+0x53/0x80
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  intel_pstate_hwp_enable+0x1a2/0x1c0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  intel_pstate_resume+0x92/0xd0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  cpufreq_resume+0x75/0x150
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  dpm_resume+0x125/0x190
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  dpm_resume_end+0x11/0x20
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  suspend_devices_and_enter+0x1df/0x700
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  pm_suspend+0x1ae/0x360
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  state_store+0x75/0xe0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  kernfs_fop_write_iter+0x12d/0x1d0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  vfs_write+0x362/0x480
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  ksys_write+0x69/0xf0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  do_syscall_64+0xa8/0x1b0
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  entry_SYSCALL_64_after_hwframe+0x77/0x7f
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RIP: 0033:0x7f9c1aaed887
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Code: 10 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 01>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RSP: 002b:00007ffff56babc8 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RAX: ffffffffffffffda RBX: 0000000000000004 RCX: 00007f9c1aaed887
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RDX: 0000000000000004 RSI: 000055891f75f570 RDI: 0000000000000004
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RBP: 000055891f75f570 R08: 0000000000000000 R09: 000055891f75f570
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: R10: 00007f9c1abf3fc0 R11: 0000000000000246 R12: 0000000000000004
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: R13: 000055891f75c510 R14: 00007f9c1abf0600 R15: 00007f9c1abefa00
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  </TASK>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 45s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Modules linked in: snd_usb_audio snd_usbmidi_lib snd_hwdep snd_sof_probes snd_sof_ipc_msg_injector snd_sof_nocodec s>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  processor_thermal_wt_req soundcore processor_thermal_power_floor ttm drm_display_helper processor_thermal_mbox drm_>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: CPU: 1 PID: 72543 Comm: rtcwake Not tainted 6.9.0-rc5-gd99d9a0ab917 #dev
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Hardware name: Intel Corporation Lunar Lake Client Platform/LNL-M LP5 RVP1, BIOS LNLMFWI1.R00.2470.D84.2312061937 12>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RIP: 0010:smp_call_function_single+0xfa/0x140
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Code: 25 28 00 00 00 75 55 c9 c3 cc cc cc cc 48 89 e6 48 89 54 24 18 4c 89 44 24 10 e8 61 fe ff ff 8b 54 24 08 83 e2>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RSP: 0018:ffffbc84868abc80 EFLAGS: 00000202
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RAX: 0000000000000000 RBX: ffffa3824b54b400 RCX: ffffa3824ee55978
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RDX: 0000000000000001 RSI: ffffbc84868abc80 RDI: ffffbc84868abc80
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RBP: ffffbc84868abcc8 R08: ffffffff922c5250 R09: 0000000000000001
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: R10: ffffbc84868abd20 R11: 0000000000001281 R12: 00000882605b0a4b
.
.
.
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 71s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Modules linked in: snd_usb_audio snd_usbmidi_lib snd_hwdep snd_sof_probes snd_sof_ipc_msg_injector snd_sof_nocodec s>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel:  processor_thermal_wt_req soundcore processor_thermal_power_floor ttm drm_display_helper processor_thermal_mbox drm_>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: CPU: 1 PID: 72543 Comm: rtcwake Tainted: G             L     6.9.0-rc5-gd99d9a0ab917 #dev
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Hardware name: Intel Corporation Lunar Lake Client Platform/LNL-M LP5 RVP1, BIOS LNLMFWI1.R00.2470.D84.2312061937 12>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RIP: 0010:smp_call_function_single+0xfa/0x140
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: Code: 25 28 00 00 00 75 55 c9 c3 cc cc cc cc 48 89 e6 48 89 54 24 18 4c 89 44 24 10 e8 61 fe ff ff 8b 54 24 08 83 e2>
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RSP: 0018:ffffbc84868abc80 EFLAGS: 00000202
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RAX: 0000000000000000 RBX: ffffa3824b54b400 RCX: ffffa3824ee55978
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RDX: 0000000000000001 RSI: ffffbc84868abc80 RDI: ffffbc84868abc80
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: RBP: ffffbc84868abcc8 R08: ffffffff922c5250 R09: 0000000000000001
.
.
.
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 130s! [rtcwake:72543]
marc-hb commented 6 months ago

Note there are also ACPI errors in the logs but we've always ignored them

journalctl -b -2 -p 3

May 20 15:46:29 ba-lnlm-rvp-nocodec-01 kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PC00.LPCB.H_EC.ECNT.PPOE], AE_NOT_FOUND (20230628/psargs-330)
May 20 15:46:29 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.H_EC.ECNT due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:46:29 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.NTIR due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:46:29 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PEPD._DSM due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:46:40 ba-lnlm-rvp-nocodec-01 kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PC00.LPCB.H_EC.ECNT.PPOE], AE_NOT_FOUND (20230628/psargs-330)
May 20 15:46:40 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.H_EC.ECNT due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:46:40 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.NTIR due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:46:40 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PEPD._DSM due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:46:51 ba-lnlm-rvp-nocodec-01 kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PC00.LPCB.H_EC.ECNT.PPOE], AE_NOT_FOUND (20230628/psargs-330)
May 20 15:46:51 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.H_EC.ECNT due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:46:51 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.NTIR due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:46:51 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PEPD._DSM due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:47:04 ba-lnlm-rvp-nocodec-01 kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PC00.LPCB.H_EC.ECNT.PPOE], AE_NOT_FOUND (20230628/psargs-330)
May 20 15:47:04 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.H_EC.ECNT due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:47:04 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.NTIR due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:47:04 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PEPD._DSM due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:47:17 ba-lnlm-rvp-nocodec-01 kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PC00.LPCB.H_EC.ECNT.PPOE], AE_NOT_FOUND (20230628/psargs-330)
May 20 15:47:17 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.H_EC.ECNT due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:47:17 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.NTIR due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:47:17 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PEPD._DSM due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PC00.LPCB.H_EC.ECNT.PPOE], AE_NOT_FOUND (20230628/psargs-330)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.H_EC.ECNT due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PC00.LPCB.NTIR due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: ACPI Error: Aborting method \_SB.PEPD._DSM due to previous error (AE_NOT_FOUND) (20230628/psparse-529)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu: INFO: rcu_preempt self-detected stall on CPU
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         1-....: (20981 ticks this GP) idle=2714/1/0x4000000000000000 softirq=1101696/1101696 fqs=1989
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         (t=21000 jiffies g=1105033 q=94 ncpus=8)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 45s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 71s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu: INFO: rcu_preempt self-detected stall on CPU
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         1-....: (83985 ticks this GP) idle=2714/1/0x4000000000000000 softirq=1101696/1101696 fqs=17574
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         (t=84004 jiffies g=1105033 q=138 ncpus=8)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 104s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 130s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu: INFO: rcu_preempt self-detected stall on CPU
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         1-....: (146988 ticks this GP) idle=2714/1/0x4000000000000000 softirq=1101696/1101696 fqs=33316
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         (t=147007 jiffies g=1105033 q=208 ncpus=8)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 160s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 186s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu: INFO: rcu_preempt self-detected stall on CPU
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         1-....: (209992 ticks this GP) idle=2714/1/0x4000000000000000 softirq=1101696/1101696 fqs=49057
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: rcu:         (t=210011 jiffies g=1105033 q=239 ncpus=8)
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 220s! [rtcwake:72543]
May 20 15:52:21 ba-lnlm-rvp-nocodec-01 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 246s! [rtcwake:72543]
May 20 15:52:20 ba-lnlm-rvp-nocodec-01 systemd[1]: systemd-journald.service: Watchdog timeout (limit 3min)!
marc-hb commented 4 months ago

I have not seen this in ages. @ranj063 , @plbossart OK to close?

plbossart commented 4 months ago

agree, probably unrelated to audio. let's close