thesofproject / linux

Linux kernel source tree
Other
91 stars 133 forks source link

LNL/SDW_AOIC kernel oops on remove #4891

Closed plbossart closed 7 months ago

plbossart commented 8 months ago

that's a new one...

[  118.928143] usbcore: deregistering interface driver snd-usb-audio
[  119.473197] ------------[ cut here ]------------
[  119.473200] remove_proc_entry: removing non-empty directory 'irq/155', leaking at least 'AudioDSP'
[  119.473207] WARNING: CPU: 2 PID: 2253 at fs/proc/generic.c:717 remove_proc_entry+0x198/0x1b0
[  119.473213] Modules linked in: snd_sof_ipc_msg_injector snd_soc_sof_sdw snd_sof_probes snd_soc_intel_sof_maxim_common snd_soc_intel_hda_dsp_common snd_soc_dmic snd_sof_pci_intel_lnl(-) snd_sof_pci_intel_mtl snd_sof_intel_hda_generic soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda_common snd_sof_intel_hda_mlink snd_sof_intel_hda snd_soc_hdac_hda snd_hda_ext_core snd_hda_codec snd_hwdep snd_hda_core snd_sof_pci snd_sof_acpi_intel_bdw snd_sof_acpi_intel_byt snd_sof_intel_atom snd_sof_xtensa_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_sof_acpi snd_sof snd_sof_utils snd_intel_dspcfg snd_intel_sdw_acpi snd_soc_es8326 snd_soc_es8316 snd_soc_max98390 snd_soc_max98373_i2c snd_soc_max98373_sdw snd_soc_max98373 snd_soc_max98363 snd_soc_max98357a snd_soc_ts3a227e snd_soc_max98090 snd_soc_rt5682_sdw snd_soc_rt5682_i2c snd_soc_rt5682 snd_soc_rt5677 snd_soc_rt5677_spi snd_soc_rt5670 snd_soc_rt5660 snd_soc_rt5651 snd_soc_rt5645 snd_soc_rt5640 snd_soc_rt1011 snd_soc_sdw_mockup snd_soc_rt1318_sdw
[  119.473251]  snd_soc_rt1316_sdw snd_soc_rt1308_sdw snd_soc_rt1308 snd_soc_rl6231 snd_soc_rt715_sdca snd_soc_rt715 snd_soc_rt712_sdca_dmic snd_soc_rt712_sdca snd_soc_rt711_sdca regmap_sdw_mbq snd_soc_rt711 snd_soc_rt700 regmap_sdw soundwire_bus snd_soc_rt298 snd_soc_rt286 snd_soc_rt274 snd_soc_rl6347a snd_soc_wm8804_i2c snd_soc_wm8804 snd_soc_pcm512x_i2c snd_soc_pcm512x snd_soc_da7219 snd_soc_da7213 snd_soc_core snd_compress regmap_i2c snd_pcm r8153_ecm cdc_ether usbnet snd_ctl_led ledtrig_audio squashfs r8152 snd_usbmidi_lib snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device wmi_bmof snd_timer x86_pkg_temp_thermal snd intel_powerclamp soundcore video wmi fuse drm efivarfs intel_lpss_pci xhci_pci intel_lpss xhci_hcd idma64 mfd_core [last unloaded: snd_sof_pci_intel_cnl]
[  119.473292] CPU: 2 PID: 2253 Comm: rmmod Not tainted 6.8.0-rc5-test-07166-gf3f97ec8d353 #27
[  119.473294] Hardware name:  <beep>
[  119.473296] RIP: 0010:remove_proc_entry+0x198/0x1b0
[  119.473298] Code: 95 48 85 c0 48 8d 90 38 ff ff ff 48 0f 45 c2 48 8b 93 b8 00 00 00 4c 8b 80 e0 00 00 00 48 8b 92 e0 00 00 00 e8 79 10 c9 ff 90 <0f> 0b 90 90 e9 41 ff ff ff e8 fa 44 c9 00 66 2e 0f 1f 84 00 00 00
[  119.473299] RSP: 0018:ffffbc2b813c3c38 EFLAGS: 00010282
[  119.473301] RAX: 0000000000000000 RBX: ffffa33a0a893300 RCX: 0000000000000027
[  119.473303] RDX: ffffa34160ddc8c8 RSI: 0000000000000001 RDI: ffffa34160ddc8c0
[  119.473304] RBP: ffffa33a0a8933c0 R08: 00000000ffffdfff R09: 0000000000000001
[  119.473305] R10: 00000000ffffdfff R11: ffffffff95e85200 R12: ffffa33a046f2500
[  119.473305] R13: ffffa33a23049380 R14: 0000000000000001 R15: ffffa33a088abe80
[  119.473306] FS:  00007ff680764c40(0000) GS:ffffa34160c00000(0000) knlGS:0000000000000000
[  119.473308] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  119.473309] CR2: 00007f46b801c0a8 CR3: 0000000165ae0004 CR4: 0000000000f70ef0
[  119.473310] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  119.473310] DR3: 0000000000000000 DR6: 00000000ffff07f0 DR7: 0000000000000400
[  119.473311] PKRU: 55555554
[  119.473312] Call Trace:
[  119.473314]  <TASK>
[  119.473317]  ? __warn+0x84/0x160
[  119.473320]  ? remove_proc_entry+0x198/0x1b0
[  119.473322]  ? report_bug+0x171/0x1a0
[  119.473326]  ? handle_bug+0x3b/0x70
[  119.473330]  ? exc_invalid_op+0x17/0x70
[  119.473332]  ? asm_exc_invalid_op+0x1a/0x20
[  119.473337]  ? remove_proc_entry+0x198/0x1b0
[  119.473339]  ? remove_proc_entry+0x197/0x1b0
[  119.473340]  unregister_irq_proc+0xf4/0x120
[  119.473345]  free_desc+0x3d/0xe0
[  119.473349]  irq_free_descs+0x49/0x70
[  119.473351]  msi_domain_free_locked.part.0+0x192/0x1c0
[  119.473355]  msi_domain_free_irqs_all_locked+0x6e/0x90
[  119.473358]  pci_free_msi_irqs+0x12/0x40
[  119.473361]  pci_disable_msi+0x4c/0x60
[  119.473363]  devres_release_all+0xb9/0x110
[  119.473368]  device_unbind_cleanup+0xe/0x70
[  119.473371]  device_release_driver_internal+0x1ce/0x210
[  119.473374]  driver_detach+0x4b/0x90
[  119.473376]  bus_remove_driver+0x70/0xf0
[  119.473379]  pci_unregister_driver+0x3f/0x90
[  119.473382]  __do_sys_delete_module+0x1e1/0x2c0
[  119.473388]  do_syscall_64+0xaf/0x1b0
[  119.473391]  entry_SYSCALL_64_after_hwframe+0x6f/0x77
[  119.473392] RIP: 0033:0x7ff68088caeb
[  119.473395] Code: 73 01 c3 48 8b 0d 45 33 0f 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa b8 b0 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 15 33 0f 00 f7 d8 64 89 01 48
[  119.473396] RSP: 002b:00007ffe94f9a0d8 EFLAGS: 00000206 ORIG_RAX: 00000000000000b0
[  119.473398] RAX: ffffffffffffffda RBX: 000055baa63f4790 RCX: 00007ff68088caeb
[  119.473399] RDX: 000000000000000a RSI: 0000000000000800 RDI: 000055baa63f47f8
[  119.473400] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[  119.473400] R10: 00007ff680924ac0 R11: 0000000000000206 R12: 00007ffe94f9a330
[  119.473409] R13: 000055baa63f42a0 R14: 00007ffe94f9b603 R15: 000055baa63f4790
[  119.473414]  </TASK>
[  119.473414] ---[ end trace 0000000000000000 ]---
[  119.837155] soundwire_intel soundwire_intel.link.3: intel_link_power_up: hdac_bus_eml_sdw_power_up failed: -5
[  119.837162] soundwire_intel soundwire_intel.link.3: intel_resume_runtime: power_up failed after bus reset: -5
[  119.867736] soundwire_intel soundwire_intel.link.2: intel_link_power_up: hdac_bus_eml_sdw_power_up failed: -5
[  119.867746] soundwire_intel soundwire_intel.link.2: intel_resume_runtime: power_up failed after bus reset: -5
[  119.891417] soundwire_intel soundwire_intel.link.1: intel_link_power_up: hdac_bus_eml_sdw_power_up failed: -5
[  119.891425] soundwire_intel soundwire_intel.link.1: intel_resume_runtime: power_up failed after bus reset: -5
[  119.912123] soundwire_intel soundwire_intel.link.0: intel_link_power_up: hdac_bus_eml_sdw_power_up failed: -5
[  119.912132] soundwire_intel soundwire_intel.link.0: intel_resume_runtime: power_up failed after bus reset: -5
plbossart commented 8 months ago

If I do an insert after that, I get the same errors as #4889 ....

[  370.878813] snd_hda_core: unknown parameter 'hdac_i915_timeout_ms' ignored
[  370.887040] soundwire sdw-master-0-0: runtime PM trying to activate child device sdw-master-0-0 but parent (soundwire_intel.link.0) is not active
[  370.887079] soundwire_bus:sdw_extract_slave_id: soundwire sdw-master-0-0: SDW Slave Addr: 30025d071101
[  370.887081] soundwire_bus:sdw_extract_slave_id: soundwire sdw-master-0-0: SDW Slave class_id 0x01, mfg_id 0x025d, part_id 0x0711, unique_id 0x0, version 0x3
[  370.887307] snd_soc_rt711_sdca:rt711_sdca_init: rt711-sdca sdw:0:0:025d:0711:01: rt711_sdca_init
[  370.887450] soundwire_bus:sdw_drv_probe: rt711-sdca sdw:0:0:025d:0711:01: probe complete
[  370.887583] soundwire sdw-master-0-1: runtime PM trying to activate child device sdw-master-0-1 but parent (soundwire_intel.link.1) is not active
[  370.887606] soundwire_bus:sdw_extract_slave_id: soundwire sdw-master-0-1: SDW Slave Addr: 130025d071401
[  370.887608] soundwire_bus:sdw_extract_slave_id: soundwire sdw-master-0-1: SDW Slave class_id 0x01, mfg_id 0x025d, part_id 0x0714, unique_id 0x0, version 0x3
[  370.887940] snd_soc_rt715_sdca:rt715_sdca_init: rt715-sdca sdw:0:1:025d:0714:01: rt715_sdca_init
[  370.890815] soundwire_bus:sdw_drv_probe: rt715-sdca sdw:0:1:025d:0714:01: probe complete
[  370.890921] soundwire sdw-master-0-2: runtime PM trying to activate child device sdw-master-0-2 but parent (soundwire_intel.link.2) is not active
[  370.890946] soundwire_bus:sdw_extract_slave_id: soundwire sdw-master-0-2: SDW Slave Addr: 230025d131601
[  370.890948] soundwire_bus:sdw_extract_slave_id: soundwire sdw-master-0-2: SDW Slave class_id 0x01, mfg_id 0x025d, part_id 0x1316, unique_id 0x0, version 0x3
[  370.891685] snd_soc_rt1316_sdw:rt1316_sdw_init: rt1316-sdca sdw:0:2:025d:1316:01: rt1316_sdw_init
[  370.891688] snd_soc_rt1316_sdw:rt1316_read_prop: rt1316-sdca sdw:0:2:025d:1316:01: rt1316_read_prop
[  370.891899] soundwire_bus:sdw_drv_probe: rt1316-sdca sdw:0:2:025d:1316:01: probe complete
[  370.893518] soundwire sdw-master-0-3: runtime PM trying to activate child device sdw-master-0-3 but parent (soundwire_intel.link.3) is not active
[  370.893543] soundwire_bus:sdw_extract_slave_id: soundwire sdw-master-0-3: SDW Slave Addr: 331025d131601
[  370.893544] soundwire_bus:sdw_extract_slave_id: soundwire sdw-master-0-3: SDW Slave class_id 0x01, mfg_id 0x025d, part_id 0x1316, unique_id 0x1, version 0x3
[  370.893705] snd_soc_rt1316_sdw:rt1316_sdw_init: rt1316-sdca sdw:0:3:025d:1316:01: rt1316_sdw_init
[  370.893708] snd_soc_rt1316_sdw:rt1316_read_prop: rt1316-sdca sdw:0:3:025d:1316:01: rt1316_read_prop
[  370.893781] soundwire_bus:sdw_drv_probe: rt1316-sdca sdw:0:3:025d:1316:01: probe complete
[  370.983615] snd_sof_pci:sof_pci_probe: sof-audio-pci-intel-lnl 0000:00:1f.3: PCI DSP detected
[  370.983792] sof-audio-pci-intel-lnl 0000:00:1f.3: sof_debug value: 0x1
[  370.983800] sof-audio-pci-intel-lnl 0000:00:1f.3: DSP detected with PCI class/subclass/prog-if 0x040100
[  371.083001] sof-audio-pci-intel-lnl 0000:00:1f.3: error: failed to reset HDA controller gctl 0x1
[  371.083009] sof-audio-pci-intel-lnl 0000:00:1f.3: error: get caps error
[  371.083023] sof-audio-pci-intel-lnl: probe of 0000:00:1f.3 failed with error -5
[  371.159461] usbcore: registered new interface driver snd-usb-audio
plbossart commented 7 months ago

not reproduced or seen in CI/daily test. This happened during weird debug sessions, chances are the hardware was in a semi-comatose state. closing.