Open fredoh9 opened 4 months ago
dmesg is not available @fredoh9 ?
right, dmesg was not captured in this case. I'm looking at the issue logs are not collected in some failure cases also.
I will use journalctl and upload equivalent dmesg when I have the access to the device.
dmesg for the failure boot from ba-lnlm-rvp-hda-02. dmesg_20240720_stress_test.txt
Jul 20 16:21:58.904948 kernel: snd_sof_intel_hda_common:hda_dsp_state_log: sof-audio-pci-intel-lnl 0000:00:1f.3: Current DSP power state: D3
Jul 20 16:21:58.905276 kernel: snd_sof:sof_set_fw_state: sof-audio-pci-intel-lnl 0000:00:1f.3: fw_state change: 7 -> 0
Jul 20 16:21:58.905612 kernel: ACPI: EC: interrupt blocked
Jul 20 16:21:58.905671 kernel: ACPI: EC: interrupt unblocked
Jul 20 16:21:58.905713 kernel: BUG: workqueue lockup - pool cpus=0-7 node=0 flags=0x4 nice=0 stuck for 54s!
Jul 20 16:21:58.905922 kernel: Showing busy workqueues and worker pools:
Jul 20 16:21:58.952292 kernel: workqueue events: flags=0x0
Jul 20 16:21:58.952375 kernel: pwq 2: cpus=0 node=0 flags=0x0 nice=0 active=1 refcnt=2
Jul 20 16:21:58.952417 kernel: pending: vmstat_shepherd
Jul 20 16:21:58.952457 kernel: workqueue events_unbound: flags=0x2
Jul 20 16:21:58.952496 kernel: pwq 33: cpus=0-7 node=0 flags=0x4 nice=0 active=1 refcnt=2
Jul 20 16:21:58.952542 kernel: pending: crng_reseed
Found this error from weekend stress test in LNLM_RVP_HDA.
The command to reproduce: TPLG=/lib/firmware/intel/sof-ipc4-tplg/sof-hda-generic-ace1-4ch.tplg MODEL=LNLM_RVP_HDA SOF_TEST_INTERVAL=5 ~/sof-test/test-case/check-suspend-resume-with-audio.sh -l 100 -m playback
The error message is this,
Input/output error already happened in 4th loop
kernel lockup happened in 25th loop
I don't remember this error lately, reproduction rate will be very low.
Intel Internal test result link: planresultdetail/44120?model=LNLM_RVP_HDA&testcase=check-suspend-resume-with-playback-100