M-Bab / linux-kernel-amdgpu-binaries

Kernel binaries (amd64) of amd-staging with DAL and latest security patches
214 stars 29 forks source link

System crash when connecting external display through HDMI | BUG: unable to handle kernel NULL pointer dereference at 0000000000000134 | IP: dce110_validate_global+0x3d/0x80 [amdgpu] #27

Closed dualbus closed 6 years ago

dualbus commented 7 years ago

My system crashes everytime I plug an external display (LG TV) through HDMI.

Kernel:

dualbus@ubuntu:~$ uname -r
4.12.13+

Distro:

dualbus@ubuntu:~$ lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description:    Ubuntu 17.04
Release:    17.04
Codename:   zesty

Deb:

dualbus@ubuntu:~$ dpkg -l linux-headers-\* linux-image-\* firmware-\* | grep '^ii'
ii  firmware-radeon-ucode               1.50                   all          Firmware binaries for modern AMD GPUs
ii  linux-headers-4.10.0-33             4.10.0-33.37           all          Header files related to Linux kernel version 4.10.0
ii  linux-headers-4.10.0-33-generic     4.10.0-33.37           amd64        Linux kernel headers for version 4.10.0 on 64 bit x86 SMP
ii  linux-headers-4.12.13+              17.09.15.AMDGPU.UBUNTU amd64        Header files related to Linux kernel, specifically,
ii  linux-headers-generic               4.10.0.33.33           amd64        Generic Linux kernel headers
ii  linux-image-4.10.0-33-generic       4.10.0-33.37           amd64        Linux kernel image for version 4.10.0 on 64 bit x86 SMP
ii  linux-image-4.12.13+                17.09.15.AMDGPU.UBUNTU amd64        Linux kernel binary image for version 4.12.13+
ii  linux-image-extra-4.10.0-33-generic 4.10.0-33.37           amd64        Linux kernel extra modules for version 4.10.0 on 64 bit x86 SMP
ii  linux-image-generic                 4.10.0.33.33           amd64        Generic Linux kernel image

PCI:

dualbus@ubuntu:~$ lspci -vvnn
00:00.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1576]
    Subsystem: Hewlett-Packard Company Device [103c:8246]
    Control: I/O- Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0

00:00.2 IOMMU [0806]: Advanced Micro Devices, Inc. [AMD] Device [1022:1577]
    Subsystem: Hewlett-Packard Company Device [103c:8246]
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Interrupt: pin A routed to IRQ 24
    Capabilities: <access denied>

00:01.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Carrizo [1002:9874] (rev c5) (prog-if 00 [VGA controller])
    Subsystem: Hewlett-Packard Company Carrizo [103c:8246]
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 47
    Region 0: Memory at e0000000 (64-bit, prefetchable) [size=256M]
    Region 2: Memory at f0000000 (64-bit, prefetchable) [size=8M]
    Region 4: I/O ports at f000 [size=256]
    Region 5: Memory at ff700000 (32-bit, non-prefetchable) [size=256K]
    Expansion ROM at 000c0000 [disabled] [size=128K]
    Capabilities: <access denied>
    Kernel driver in use: amdgpu
    Kernel modules: amdgpu

00:01.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Kabini HDMI/DP Audio [1002:9840]
    Subsystem: Hewlett-Packard Company Kabini HDMI/DP Audio [103c:8246]
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin B routed to IRQ 49
    Region 0: Memory at ff764000 (64-bit, non-prefetchable) [size=16K]
    Capabilities: <access denied>
    Kernel driver in use: snd_hda_intel
    Kernel modules: snd_hda_intel

00:02.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:157b]
    DeviceName:  Onboard IGD
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-

00:02.2 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Device [1022:157c] (prog-if 00 [Normal decode])
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 25
    Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
    Memory behind bridge: ff600000-ff6fffff
    Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR-
    BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B-
        PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
    Capabilities: <access denied>
    Kernel driver in use: pcieport
    Kernel modules: shpchp

00:02.3 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Device [1022:157c] (prog-if 00 [Normal decode])
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 25
    Bus: primary=00, secondary=02, subordinate=02, sec-latency=0
    I/O behind bridge: 0000e000-0000efff
    Memory behind bridge: ff500000-ff5fffff
    Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ <SERR- <PERR-
    BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B-
        PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
    Capabilities: <access denied>
    Kernel driver in use: pcieport
    Kernel modules: shpchp

00:02.4 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Device [1022:157c] (prog-if 00 [Normal decode])
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 26
    Bus: primary=00, secondary=03, subordinate=03, sec-latency=0
    I/O behind bridge: 0000d000-0000dfff
    Memory behind bridge: ff400000-ff4fffff
    Prefetchable memory behind bridge: 00000000f0900000-00000000f0afffff
    Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR-
    BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B-
        PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
    Capabilities: <access denied>
    Kernel driver in use: pcieport
    Kernel modules: shpchp

00:03.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:157b]
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-

00:03.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Device [1022:157c] (prog-if 00 [Normal decode])
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 28
    Bus: primary=00, secondary=04, subordinate=04, sec-latency=0
    I/O behind bridge: 00001000-00001fff
    Memory behind bridge: f0b00000-f0cfffff
    Prefetchable memory behind bridge: 00000000f0d00000-00000000f0efffff
    Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR-
    BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B-
        PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
    Capabilities: <access denied>
    Kernel driver in use: pcieport
    Kernel modules: shpchp

00:08.0 Encryption controller [1080]: Advanced Micro Devices, Inc. [AMD] Device [1022:1578]
    Subsystem: Hewlett-Packard Company Device [103c:8246]
    Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 5
    Region 0: Memory at f0800000 (64-bit, prefetchable) [size=128K]
    Region 2: Memory at ff300000 (32-bit, non-prefetchable) [size=1M]
    Region 3: Memory at ff76f000 (32-bit, non-prefetchable) [size=4K]
    Region 5: Memory at ff76a000 (32-bit, non-prefetchable) [size=8K]
    Capabilities: <access denied>

00:09.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:157d]
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-

00:09.2 Audio device [0403]: Advanced Micro Devices, Inc. [AMD] Device [1022:157a]
    Subsystem: Hewlett-Packard Company Device [103c:8246]
    Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 50
    Region 0: Memory at ff760000 (32-bit, non-prefetchable) [size=16K]
    Capabilities: <access denied>
    Kernel driver in use: snd_hda_intel
    Kernel modules: snd_hda_intel

00:10.0 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] FCH USB XHCI Controller [1022:7914] (rev 20) (prog-if 30 [XHCI])
    Subsystem: Hewlett-Packard Company FCH USB XHCI Controller [103c:8246]
    Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0, Cache Line Size: 64 bytes
    Interrupt: pin A routed to IRQ 18
    Region 0: Memory at ff768000 (64-bit, non-prefetchable) [size=8K]
    Capabilities: <access denied>
    Kernel driver in use: xhci_hcd

00:11.0 SATA controller [0106]: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] [1022:7901] (rev 49) (prog-if 01 [AHCI 1.0])
    Subsystem: Hewlett-Packard Company FCH SATA Controller [AHCI mode] [103c:8246]
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx+
    Latency: 32
    Interrupt: pin A routed to IRQ 38
    Region 0: I/O ports at f140 [size=8]
    Region 1: I/O ports at f130 [size=4]
    Region 2: I/O ports at f120 [size=8]
    Region 3: I/O ports at f110 [size=4]
    Region 4: I/O ports at f100 [size=16]
    Region 5: Memory at ff76d000 (32-bit, non-prefetchable) [size=1K]
    Capabilities: <access denied>
    Kernel driver in use: ahci
    Kernel modules: ahci

00:12.0 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] FCH USB EHCI Controller [1022:7908] (rev 49) (prog-if 20 [EHCI])
    Subsystem: Hewlett-Packard Company FCH USB EHCI Controller [103c:8246]
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 32, Cache Line Size: 64 bytes
    Interrupt: pin A routed to IRQ 18
    Region 0: Memory at ff76c000 (32-bit, non-prefetchable) [size=256]
    Capabilities: <access denied>
    Kernel driver in use: ehci-pci

00:14.0 SMBus [0c05]: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller [1022:790b] (rev 4a)
    Subsystem: Hewlett-Packard Company FCH SMBus Controller [103c:8246]
    Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap- 66MHz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Kernel driver in use: piix4_smbus
    Kernel modules: i2c_piix4, sp5100_tco

00:14.3 ISA bridge [0601]: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge [1022:790e] (rev 11)
    Subsystem: Hewlett-Packard Company FCH LPC Bridge [103c:8246]
    Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap- 66MHz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0

00:18.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1570]
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-

00:18.1 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1571]
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-

00:18.2 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1572]
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-

00:18.3 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1573]
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Capabilities: <access denied>
    Kernel driver in use: k10temp
    Kernel modules: k10temp

00:18.4 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1574]
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Kernel driver in use: fam15h_power
    Kernel modules: fam15h_power

00:18.5 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1575]
    Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-

01:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS5229 PCI Express Card Reader [10ec:5229] (rev 01)
    DeviceName: Realtek PCIE CardReader
    Subsystem: Hewlett-Packard Company RTS5229 PCI Express Card Reader [103c:8246]
    Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 35
    Region 0: Memory at ff600000 (32-bit, non-prefetchable) [size=4K]
    Capabilities: <access denied>
    Kernel driver in use: rtsx_pci
    Kernel modules: rtsx_pci

02:00.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. RTL8101/2/6E PCI Express Fast/Gigabit Ethernet controller [10ec:8136] (rev 07)
    DeviceName: Realtek PCIe FE Family Controller
    Subsystem: Hewlett-Packard Company RTL8101/2/6E PCI Express Fast/Gigabit Ethernet controller [103c:8246]
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0, Cache Line Size: 64 bytes
    Interrupt: pin A routed to IRQ 37
    Region 0: I/O ports at e000 [size=256]
    Region 2: Memory at ff514000 (64-bit, prefetchable) [size=4K]
    Region 4: Memory at ff510000 (64-bit, prefetchable) [size=16K]
    Expansion ROM at ff500000 [disabled] [size=64K]
    Capabilities: <access denied>
    Kernel driver in use: r8169
    Kernel modules: r8169

03:00.0 Network controller [0280]: Realtek Semiconductor Co., Ltd. RTL8723BE PCIe Wireless Network Adapter [10ec:b723]
    DeviceName: Realtek RTL8723BE 802.11b/g/n Wi-Fi + BT 4.0 Combo Adapter
    Subsystem: Hewlett-Packard Company RTL8723BE PCIe Wireless Network Adapter [103c:81c1]
    Physical Slot: 0
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 51
    Region 0: I/O ports at d000 [size=256]
    Region 2: Memory at ff400000 (64-bit, non-prefetchable) [size=16K]
    Capabilities: <access denied>
    Kernel driver in use: rtl8723be
    Kernel modules: rtl8723be

Kernel Oops report:

dualbus@ubuntu:~$ awk '/ BUG: /,/ end trace /' /var/log/syslog*  
Sep 17 13:40:39 ubuntu kernel: [  128.677047] BUG: unable to handle kernel NULL pointer dereference at 0000000000000134
Sep 17 13:40:39 ubuntu kernel: [  128.677196] IP: dce110_validate_global+0x3d/0x80 [amdgpu]
Sep 17 13:40:39 ubuntu kernel: [  128.677234] PGD 0 
Sep 17 13:40:39 ubuntu kernel: [  128.677236] P4D 0 
Sep 17 13:40:39 ubuntu kernel: [  128.677252] 
Sep 17 13:40:39 ubuntu kernel: [  128.677284] Oops: 0000 [#1] SMP
Sep 17 13:40:39 ubuntu kernel: [  128.677308] Modules linked in: hid_generic usbhid hid rfcomm cmac bnep nls_iso8859_1 snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel snd_hda_codec snd_hda_core uvcvideo snd_hwdep videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 snd_pcm snd_seq_midi hp_wmi videobuf2_core sparse_keymap videodev media snd_seq_midi_event edac_mce_amd btusb btrtl arc4 snd_rawmidi btbcm kvm_amd btintel rtl8723be kvm btcoexist snd_seq bluetooth rtl8723_common rtl_pci rtlwifi mac80211 snd_seq_device cfg80211 snd_timer irqbypass joydev input_leds serio_raw fam15h_power k10temp snd rtsx_pci_ms ecdh_generic memstick soundcore i2c_piix4 tpm_crb shpchp i2c_designware_platform hp_accel lis3lv02d input_polldev i2c_designware_core i2c_scmi 8250_dw hp_wireless mac_hid parport_pc ppdev lp parport ip_tables
Sep 17 13:40:39 ubuntu kernel: [  128.677796]  x_tables autofs4 algif_skcipher af_alg dm_crypt amdkfd amd_iommu_v2 amdgpu crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc rtsx_pci_sdmmc aesni_intel aes_x86_64 i2c_algo_bit crypto_simd glue_helper ttm cryptd psmouse drm_kms_helper syscopyarea sysfillrect sysimgblt ahci fb_sys_fops libahci r8169 drm mii rtsx_pci wmi video
Sep 17 13:40:39 ubuntu kernel: [  128.678011] CPU: 2 PID: 1234 Comm: Xorg Not tainted 4.12.13+ #1
Sep 17 13:40:39 ubuntu kernel: [  128.678050] Hardware name: HP HP Pavilion Notebook/8246, BIOS F.05 06/02/2016
Sep 17 13:40:39 ubuntu kernel: [  128.678096] task: ffff96ffe2364240 task.stack: ffffae838345c000
Sep 17 13:40:39 ubuntu kernel: [  128.678189] RIP: 0010:dce110_validate_global+0x3d/0x80 [amdgpu]
Sep 17 13:40:39 ubuntu kernel: [  128.678228] RSP: 0018:ffffae838345faa8 EFLAGS: 00010297
Sep 17 13:40:39 ubuntu kernel: [  128.678264] RAX: ffff96ffe1da8070 RBX: ffff96ffe1da8000 RCX: 0000000000000000
Sep 17 13:40:39 ubuntu kernel: [  128.678310] RDX: 0000000000000008 RSI: ffff96ffe1da8000 RDI: ffff96ffe1da80a8
Sep 17 13:40:39 ubuntu kernel: [  128.678355] RBP: ffffae838345faa8 R08: 0000000000000000 R09: ffff96ffe0f5e600
Sep 17 13:40:39 ubuntu kernel: [  128.678400] R10: ffffae838345fa88 R11: 0000000000000000 R12: ffff96ffe30ff338
Sep 17 13:40:39 ubuntu kernel: [  128.678446] R13: 0000000000000000 R14: ffff96ffe39d5e80 R15: 0000000000000000
Sep 17 13:40:39 ubuntu kernel: [  128.678493] FS:  00007f0d5306ca40(0000) GS:ffff96fffed00000(0000) knlGS:0000000000000000
Sep 17 13:40:39 ubuntu kernel: [  128.678544] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 17 13:40:39 ubuntu kernel: [  128.678582] CR2: 0000000000000134 CR3: 00000003dc8b9000 CR4: 00000000001406e0
Sep 17 13:40:39 ubuntu kernel: [  128.678627] Call Trace:
Sep 17 13:40:39 ubuntu kernel: [  128.678694]  dc_validate_global_state+0x3d/0x2f0 [amdgpu]
Sep 17 13:40:39 ubuntu kernel: [  128.678750]  ? drm_modeset_lock_all_ctx+0xa4/0xb0 [drm]
Sep 17 13:40:39 ubuntu kernel: [  128.678852]  amdgpu_dm_atomic_check+0x3cb/0x400 [amdgpu]
Sep 17 13:40:39 ubuntu kernel: [  128.678900]  drm_atomic_check_only+0x467/0x570 [drm]
Sep 17 13:40:39 ubuntu kernel: [  128.678944]  ? handle_conflicting_encoders+0x289/0x2a0 [drm_kms_helper]
Sep 17 13:40:39 ubuntu kernel: [  128.678999]  drm_atomic_commit+0x18/0x50 [drm]
Sep 17 13:40:39 ubuntu kernel: [  128.679035]  drm_atomic_helper_set_config+0x70/0xa0 [drm_kms_helper]
Sep 17 13:40:39 ubuntu kernel: [  128.679088]  __drm_mode_set_config_internal+0x65/0x110 [drm]
Sep 17 13:40:39 ubuntu kernel: [  128.679142]  drm_mode_setcrtc+0x4f1/0x660 [drm]
Sep 17 13:40:39 ubuntu kernel: [  128.679187]  drm_ioctl+0x216/0x4c0 [drm]
Sep 17 13:40:39 ubuntu kernel: [  128.679225]  ? drm_mode_getcrtc+0x180/0x180 [drm]
Sep 17 13:40:39 ubuntu kernel: [  128.679298]  amdgpu_drm_ioctl+0x4f/0x90 [amdgpu]
Sep 17 13:40:39 ubuntu kernel: [  128.679334]  do_vfs_ioctl+0xa3/0x610
Sep 17 13:40:39 ubuntu kernel: [  128.679361]  ? handle_mm_fault+0xd2/0x230
Sep 17 13:40:39 ubuntu kernel: [  128.679391]  ? __do_page_fault+0x266/0x4e0
Sep 17 13:40:39 ubuntu kernel: [  128.679421]  SyS_ioctl+0x79/0x90
Sep 17 13:40:39 ubuntu kernel: [  128.679446]  entry_SYSCALL_64_fastpath+0x1e/0xa9
Sep 17 13:40:39 ubuntu kernel: [  128.679478] RIP: 0033:0x7f0d50991587
Sep 17 13:40:39 ubuntu kernel: [  128.679503] RSP: 002b:00007fff6468de28 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
Sep 17 13:40:39 ubuntu kernel: [  128.679552] RAX: ffffffffffffffda RBX: 0000563505270df0 RCX: 00007f0d50991587
Sep 17 13:40:39 ubuntu kernel: [  128.680735] RDX: 00007fff6468de60 RSI: 00000000c06864a2 RDI: 000000000000000d
Sep 17 13:40:39 ubuntu kernel: [  128.680735] RBP: 0000000000000001 R08: 0000000000000000 R09: 0000563505377d70
Sep 17 13:40:39 ubuntu kernel: [  128.680735] R10: 00007fff6468e050 R11: 0000000000003246 R12: 0000000000000556
Sep 17 13:40:39 ubuntu kernel: [  128.680735] R13: 0000000000000000 R14: 0000563505379e90 R15: 0000563505377610
Sep 17 13:40:39 ubuntu kernel: [  128.680735] Code: 74 5b 8d 4a ff 48 8d 46 38 48 8d 14 cd 00 00 00 00 48 29 ca 48 8d 7c d6 70 31 d2 8b 08 85 c9 74 2f 83 f9 02 7f 3e 48 8b 4c d6 40 <83> b9 34 01 00 00 0a 76 12 81 79 50 80 07 00 00 7f 27 81 79 54 
Sep 17 13:40:39 ubuntu kernel: [  128.680735] RIP: dce110_validate_global+0x3d/0x80 [amdgpu] RSP: ffffae838345faa8
Sep 17 13:40:39 ubuntu kernel: [  128.680735] CR2: 0000000000000134
Sep 17 13:40:39 ubuntu kernel: [  128.708072] ---[ end trace 1d874cedb9128be4 ]---
Sep 17 13:56:28 ubuntu kernel: [  306.631719] BUG: unable to handle kernel NULL pointer dereference at 0000000000000134
Sep 17 13:56:28 ubuntu kernel: [  306.631872] IP: dce110_validate_global+0x3d/0x80 [amdgpu]
Sep 17 13:56:28 ubuntu kernel: [  306.631910] PGD 0 
Sep 17 13:56:28 ubuntu kernel: [  306.631911] P4D 0 
Sep 17 13:56:28 ubuntu kernel: [  306.631928] 
Sep 17 13:56:28 ubuntu kernel: [  306.631957] Oops: 0000 [#1] SMP
Sep 17 13:56:28 ubuntu kernel: [  306.631981] Modules linked in: ccm rfcomm cmac bnep nls_iso8859_1 hp_wmi arc4 snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic sparse_keymap rtl8723be snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep uvcvideo btcoexist rtl8723_common rtl_pci videobuf2_vmalloc edac_mce_amd snd_pcm btusb rtlwifi videobuf2_memops btrtl mac80211 kvm_amd snd_seq_midi snd_seq_midi_event snd_rawmidi btbcm btintel snd_seq input_leds videobuf2_v4l2 kvm videobuf2_core videodev bluetooth joydev media ecdh_generic snd_seq_device snd_timer serio_raw snd irqbypass cfg80211 k10temp fam15h_power soundcore rtsx_pci_ms memstick i2c_piix4 shpchp hp_accel i2c_designware_platform tpm_crb mac_hid i2c_scmi 8250_dw lis3lv02d input_polldev i2c_designware_core hp_wireless parport_pc ppdev lp parport ip_tables x_tables autofs4
Sep 17 13:56:28 ubuntu kernel: [  306.632525]  algif_skcipher af_alg dm_crypt hid_generic usbhid hid amdkfd amd_iommu_v2 amdgpu crct10dif_pclmul rtsx_pci_sdmmc crc32_pclmul ghash_clmulni_intel pcbc i2c_algo_bit ttm aesni_intel aes_x86_64 ahci crypto_simd glue_helper cryptd psmouse libahci drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops r8169 drm rtsx_pci mii wmi video
Sep 17 13:56:28 ubuntu kernel: [  306.632749] CPU: 1 PID: 1217 Comm: Xorg Not tainted 4.12.13+ #1
Sep 17 13:56:28 ubuntu kernel: [  306.632790] Hardware name: HP HP Pavilion Notebook/8246, BIOS F.05 06/02/2016
Sep 17 13:56:28 ubuntu kernel: [  306.632836] task: ffff8ac629f40000 task.stack: ffff9ae2c34e4000
Sep 17 13:56:28 ubuntu kernel: [  306.632923] RIP: 0010:dce110_validate_global+0x3d/0x80 [amdgpu]
Sep 17 13:56:28 ubuntu kernel: [  306.632963] RSP: 0018:ffff9ae2c34e7aa8 EFLAGS: 00010297
Sep 17 13:56:28 ubuntu kernel: [  306.632999] RAX: ffff8ac625518070 RBX: ffff8ac625518000 RCX: 0000000000000000
Sep 17 13:56:28 ubuntu kernel: [  306.633044] RDX: 0000000000000008 RSI: ffff8ac625518000 RDI: ffff8ac6255180a8
Sep 17 13:56:28 ubuntu kernel: [  306.633089] RBP: ffff9ae2c34e7aa8 R08: 0000000000000000 R09: ffff8ac627d3ec00
Sep 17 13:56:28 ubuntu kernel: [  306.633135] R10: ffff9ae2c34e7a88 R11: 0000000000000000 R12: ffff8ac623bb6b38
Sep 17 13:56:28 ubuntu kernel: [  306.633180] R13: 0000000000000000 R14: ffff8ac5ff377b80 R15: 0000000000000000
Sep 17 13:56:28 ubuntu kernel: [  306.633227] FS:  00007f4cc6178a40(0000) GS:ffff8ac63ec80000(0000) knlGS:0000000000000000
Sep 17 13:56:28 ubuntu kernel: [  306.633279] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 17 13:56:28 ubuntu kernel: [  306.633316] CR2: 0000000000000134 CR3: 00000003e73da000 CR4: 00000000001406e0
Sep 17 13:56:28 ubuntu kernel: [  306.633362] Call Trace:
Sep 17 13:56:28 ubuntu kernel: [  306.633464]  dc_validate_global_state+0x3d/0x2f0 [amdgpu]
Sep 17 13:56:28 ubuntu kernel: [  306.633520]  ? drm_modeset_lock_all_ctx+0xa4/0xb0 [drm]
Sep 17 13:56:28 ubuntu kernel: [  306.633622]  amdgpu_dm_atomic_check+0x3cb/0x400 [amdgpu]
Sep 17 13:56:28 ubuntu kernel: [  306.633670]  drm_atomic_check_only+0x467/0x570 [drm]
Sep 17 13:56:28 ubuntu kernel: [  306.633715]  ? handle_conflicting_encoders+0x289/0x2a0 [drm_kms_helper]
Sep 17 13:56:28 ubuntu kernel: [  306.633775]  drm_atomic_commit+0x18/0x50 [drm]
Sep 17 13:56:28 ubuntu kernel: [  306.633815]  drm_atomic_helper_set_config+0x70/0xa0 [drm_kms_helper]
Sep 17 13:56:28 ubuntu kernel: [  306.633874]  __drm_mode_set_config_internal+0x65/0x110 [drm]
Sep 17 13:56:28 ubuntu kernel: [  306.633928]  drm_mode_setcrtc+0x4f1/0x660 [drm]
Sep 17 13:56:28 ubuntu kernel: [  306.633974]  drm_ioctl+0x216/0x4c0 [drm]
Sep 17 13:56:28 ubuntu kernel: [  306.634016]  ? drm_mode_getcrtc+0x180/0x180 [drm]
Sep 17 13:56:28 ubuntu kernel: [  306.634093]  amdgpu_drm_ioctl+0x4f/0x90 [amdgpu]
Sep 17 13:56:28 ubuntu kernel: [  306.634133]  do_vfs_ioctl+0xa3/0x610
Sep 17 13:56:28 ubuntu kernel: [  306.634163]  ? __sys_recvmsg+0x51/0x90
Sep 17 13:56:28 ubuntu kernel: [  306.634193]  ? __sys_recvmsg+0x80/0x90
Sep 17 13:56:28 ubuntu kernel: [  306.634195]  ? ktime_get_ts64+0x4e/0xf0
Sep 17 13:56:28 ubuntu kernel: [  306.634195]  SyS_ioctl+0x79/0x90
Sep 17 13:56:28 ubuntu kernel: [  306.634195]  entry_SYSCALL_64_fastpath+0x1e/0xa9
Sep 17 13:56:28 ubuntu kernel: [  306.634195] RIP: 0033:0x7f4cc3b49587
Sep 17 13:56:28 ubuntu kernel: [  306.634195] RSP: 002b:00007fffa54c5648 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
Sep 17 13:56:28 ubuntu kernel: [  306.634195] RAX: ffffffffffffffda RBX: 0000556f9241ee90 RCX: 00007f4cc3b49587
Sep 17 13:56:28 ubuntu kernel: [  306.634195] RDX: 00007fffa54c5680 RSI: 00000000c06864a2 RDI: 000000000000000d
Sep 17 13:56:28 ubuntu kernel: [  306.634195] RBP: 0000556f9241c610 R08: 0000000000000000 R09: 0000556f92eb6540
Sep 17 13:56:28 ubuntu kernel: [  306.634195] R10: 00007fffa54c5870 R11: 0000000000003246 R12: 0000000000000002
Sep 17 13:56:28 ubuntu kernel: [  306.634195] R13: 0000000000000010 R14: 0000556f92315df0 R15: 0000000000000000
Sep 17 13:56:28 ubuntu kernel: [  306.634195] Code: 74 5b 8d 4a ff 48 8d 46 38 48 8d 14 cd 00 00 00 00 48 29 ca 48 8d 7c d6 70 31 d2 8b 08 85 c9 74 2f 83 f9 02 7f 3e 48 8b 4c d6 40 <83> b9 34 01 00 00 0a 76 12 81 79 50 80 07 00 00 7f 27 81 79 54 
Sep 17 13:56:28 ubuntu kernel: [  306.634195] RIP: dce110_validate_global+0x3d/0x80 [amdgpu] RSP: ffff9ae2c34e7aa8
Sep 17 13:56:28 ubuntu kernel: [  306.634195] CR2: 0000000000000134
Sep 17 13:56:28 ubuntu kernel: [  306.664221] ---[ end trace 72c97e0fdb474e94 ]---

Entries before one of the stack traces:

Sep 17 13:40:39 ubuntu kernel: [  128.638162] [drm] [Detect]    [HDMIA][ConnIdx:1] LG TV: [Block 0] 00 FF FF FF FF FF FF 00 1E 6D 01 
00 01 01 01 01 01 1A 01 03 80 A0 5A 78 0A EE 91 A3 54 4C 99 26 0F 50 54 A1 08 00 31 40 45 40 61 40 71 40 01 01 01 01 01 01 01 01 66 2
1 50 B0 51 00 1B 30 40 70 36 00 40 84 63 00 00 1E 64 19 00 40 41 00 26 30 18 88 36 00 40 84 63 00 00 1E 00 00 00 FD 00 3A 3E 1E 53 10
 00 0A 20 20 20 20 20 20 00 00 00 FC 00 4C 47 20 54 56 0A 20 20 20 20 20 20 20 01 6D ^
Sep 17 13:40:39 ubuntu kernel: [  128.638195] [drm] [Detect]    [HDMIA][ConnIdx:1] LG TV: [Block 1] 02 03 1C F1 48 10 22 20 05 84 03 
02 01 26 15 07 50 09 57 07 67 03 0C 00 20 00 80 1E 02 3A 80 18 71 38 2D 40 58 2C 04 05 40 84 63 00 00 1E 01 1D 80 18 71 1C 16 20 58 2
C 25 00 40 84 63 00 00 9E 01 1D 00 72 51 D0 1E 20 6E 28 55 00 40 84 63 00 00 1E 8C 0A D0 8A 20 E0 2D 10 10 3E 96 00 40 84 63 00 00 18
 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 86 ^
Sep 17 13:40:39 ubuntu kernel: [  128.638200] [drm] dc_link_detect: manufacturer_id = 6D1E, product_id = 1, serial_number = 1010101, manufacture_week = 1, manufacture_year = 26, display_name = LG TV, speaker_flag = 5, audio_mode_count = 2
Sep 17 13:40:39 ubuntu kernel: [  128.638202] [drm] dc_link_detect: mode number = 0, format_code = 2, channel_count = 5, sample_rate = 7, sample_size = 80
Sep 17 13:40:39 ubuntu kernel: [  128.638205] [drm] dc_link_detect: mode number = 1, format_code = 1, channel_count = 1, sample_rate = 87, sample_size = 7
Sep 17 13:40:39 ubuntu kernel: [  128.638208] [drm] link=1, dc_sink_in=ffff96ffdbdefc00 is now Connected
Sep 17 13:40:39 ubuntu kernel: [  128.638210] [drm] DCHPD: connector_id=1: Old sink=          (null) New sink=ffff96ffdbdefc00
Sep 17 13:40:39 ubuntu kernel: [  128.677047] BUG: unable to handle kernel NULL pointer dereference at 0000000000000134
M-Bab commented 7 years ago

Okay,

Here my setup: Ubuntu 17.04, Ubuntu Kernel 4.12.13+, R9 380, Mesa Padoka stable. Connected my PC to LG TV (I also got one) via HDMI: no crash, set HDMI Audio to TV and back: no crash.

So the issue is reproducible but not 100% generic. It would be great if you could test:

  1. Other kernels (the vanilla variant or older versions like 4.11). Also try the Ubuntu official 4.12 vanilla kernel for cross-checking (http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.12.13/).
  2. Other mesa drivers from the PPAs.
  3. Have you installed the firmware package as well?
dualbus commented 7 years ago
  1. I'm currently testing the 4.9.51+ vanilla variant with great success. I've been using HDMI video and audio output to the TV with no crashes or visible issues for a few days.

  2. No. I have the following mesa related packages installed, which are the default for Ubuntu 17.04.

dualbus@ubuntu:~$ dpkg -l mesa\* | awk '$1 == "ii" {print $2 " = " $3}'
mesa-utils = 8.3.0-4
mesa-va-drivers:amd64 = 17.0.7-0ubuntu0.17.04.1
mesa-vdpau-drivers:amd64 = 17.0.7-0ubuntu0.17.04.1
  1. Yes.
dualbus@ubuntu:~$ dpkg -l firmware\* | awk '$1 == "ii" { print $2 " = " $3}'
firmware-radeon-ucode = 1.50

I'll keep this issue open while I try to identify the root cause of the crashing behavior in the 4.12.13+ Ubuntu kernel variant, if that's ok with you :)

M-Bab commented 7 years ago

Yeah whenever you find some news you can let me know it in this thread. Especially interesting I think is the comparison to the 4.12 vanilla kernel mentioned above because this can definitely show if it is an amdgpu related problem.

But as long as 4.9 is working for you this is also good way to go.