pop-os / pop

A project for managing all Pop!_OS sources
https://system76.com/pop
2.36k stars 81 forks source link

Fresh install (21.04 & 20.04) system freezes when connecting to usb-c monitor #1778

Open ksmithut opened 3 years ago

ksmithut commented 3 years ago

Distribution (run cat /etc/os-release):

NAME="Pop!_OS"
VERSION="21.04"
ID=pop
ID_LIKE="ubuntu debian"
PRETTY_NAME="Pop!_OS 21.04"
VERSION_ID="21.04"
HOME_URL="https://pop.system76.com"
SUPPORT_URL="https://support.system76.com"
BUG_REPORT_URL="https://github.com/pop-os/pop/issues"
PRIVACY_POLICY_URL="https://system76.com/privacy"
VERSION_CODENAME=hirsute
UBUNTU_CODENAME=hirsute
LOGO=distributor-logo-pop-os

(Running on oryp5)

Related Application and/or Package Version (run apt policy $PACKAGE NAME):

Not sure what is involved here. Will update if we figure out what packages may be affecting this

Issue/Bug Description:

When connecting to my usb-c monitor, the system freezes. When booting, it lets me put in my decryption password, but freezes after it says cryptsetup: cryptdata: set up successfully.

Steps to reproduce (if you know):

I've done this on several clean installs using 21.04 and 20.04 on my oryp5

  1. Do clean install (I enabled encryption)
  2. Connect to monitor using usb-c (mine is a Dell u3818dw power delivery over usb-c is a feature of this monitor)
    • Sometimes it doesn't freeze right away, but it does sometimes. The monitor doesn't work in any case. When I try to shut down if it hasn't frozen, then it will freeze when attempting to shut down.
  3. Reboot
  4. System freezes when booting (or after entering a successful decryption password)

Expected behavior:

To be able to use a monitor over usb-c.

Other Notes:

I found some related issues that seemed similar to mine, but some didn't mention freezing like mine, or froze for reasons besides plugging in a monitor.

https://github.com/pop-os/pop/issues/1758 https://github.com/pop-os/pop/issues/1750 https://github.com/pop-os/pop/issues/1747 https://github.com/pop-os/pop/issues/1641

If I were a betting man, I'd say it has something to do with the 5.11 kernel update that comes with the latest installers.

If there are instructions on downgrading to another kernel version for kernel noobies like me, I'd be happy to try that.

ksmithut commented 3 years ago

I looked at journalctl for clues as to what might have happened. Here are the events that happened right after I plugged in the monitor:

journalctl ``` Jul 05 11:20:13 pop-os system76-power[1017]: [INFO] HotPlugDetect 0 Jul 05 11:20:13 pop-os gnome-shell[2437]: gnome-shell-extension-system76-power: hotplug event detected Jul 05 11:20:13 pop-os system76-power[1017]: [INFO] DBUS Received GetGraphics() method Jul 05 11:20:14 pop-os kernel: usb 1-2: new high-speed USB device number 9 using xhci_hcd Jul 05 11:20:14 pop-os kernel: usb 1-2: New USB device found, idVendor=0451, idProduct=8442, bcdDevice= 1.00 Jul 05 11:20:14 pop-os kernel: usb 1-2: New USB device strings: Mfr=0, Product=0, SerialNumber=1 Jul 05 11:20:14 pop-os kernel: usb 1-2: SerialNumber: 74050851CC9D Jul 05 11:20:14 pop-os kernel: hub 1-2:1.0: USB hub found Jul 05 11:20:14 pop-os kernel: hub 1-2:1.0: 6 ports detected Jul 05 11:20:14 pop-os upowerd[1300]: treating change event as add on /sys/devices/pci0000:00/0000:00:14.0/usb1/1-2 Jul 05 11:20:14 pop-os kernel: usb 1-2.5: new high-speed USB device number 10 using xhci_hcd Jul 05 11:20:15 pop-os mtp-probe[3296]: checking bus 1, device 10: "/sys/devices/pci0000:00/0000:00:14.0/usb1/1-2/1-2.5" Jul 05 11:20:15 pop-os mtp-probe[3296]: bus: 1, device: 10 was not an MTP device Jul 05 11:20:15 pop-os kernel: usb 1-2.5: New USB device found, idVendor=0451, idProduct=82ff, bcdDevice= 1.00 Jul 05 11:20:15 pop-os kernel: usb 1-2.5: New USB device strings: Mfr=0, Product=0, SerialNumber=1 Jul 05 11:20:15 pop-os kernel: usb 1-2.5: SerialNumber: 74050851CC9D Jul 05 11:20:15 pop-os kernel: hid-generic 0003:0451:82FF.0006: hiddev1,hidraw5: USB HID v1.11 Device [HID 0451:82ff] on usb-0000:00:14.0-2.5/input0 Jul 05 11:20:15 pop-os mtp-probe[3301]: checking bus 1, device 10: "/sys/devices/pci0000:00/0000:00:14.0/usb1/1-2/1-2.5" Jul 05 11:20:15 pop-os mtp-probe[3301]: bus: 1, device: 10 was not an MTP device Jul 05 11:20:17 pop-os /usr/libexec/gdm-x-session[1999]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x0000bd0c, 0x0000bd14) Jul 05 11:20:17 pop-os kernel: BUG: kernel NULL pointer dereference, address: 0000000000000000 Jul 05 11:20:17 pop-os kernel: #PF: supervisor read access in kernel mode Jul 05 11:20:17 pop-os kernel: #PF: error_code(0x0000) - not-present page Jul 05 11:20:17 pop-os kernel: PGD 0 P4D 0 Jul 05 11:20:17 pop-os kernel: Oops: 0000 [#1] SMP PTI Jul 05 11:20:17 pop-os kernel: CPU: 3 PID: 817 Comm: nv_queue Tainted: P W OE 5.11.0-7620-generic #21~1624379747~21.04~3abeff8-Ubuntu Jul 05 11:20:17 pop-os kernel: Hardware name: System76 Oryx Pro/Oryx Pro, BIOS 1.07.09-1 06/13/2019 Jul 05 11:20:17 pop-os kernel: RIP: 0010:_nv032400rm+0x21c/0x500 [nvidia] Jul 05 11:20:17 pop-os kernel: Code: 8b 87 e0 01 00 00 e8 23 bb 8d e7 45 85 ff 45 89 fc 0f 94 c0 41 f7 d4 41 83 e6 01 75 05 84 45 30 75 2c 48 8b 45 18 48 8b 4d 20 <44> 23 38 44 89 39 44 23 > Jul 05 11:20:17 pop-os kernel: RSP: 0018:ffff964340e0bd80 EFLAGS: 00010246 Jul 05 11:20:17 pop-os kernel: RAX: 0000000000000000 RBX: 0000000000000004 RCX: 0000000000000000 Jul 05 11:20:17 pop-os kernel: RDX: 0000000000000007 RSI: ffff89ed14e20008 RDI: ffff89ed0b39c008 Jul 05 11:20:17 pop-os kernel: RBP: ffff89ed09605f60 R08: 0000000000000001 R09: ffff89ed09605e68 Jul 05 11:20:17 pop-os kernel: R10: ffff89ed0b39c008 R11: 0000000010100000 R12: 00000000fffffeff Jul 05 11:20:17 pop-os kernel: R13: ffff89ed04610010 R14: 0000000000000000 R15: 0000000000000100 Jul 05 11:20:17 pop-os kernel: FS: 0000000000000000(0000) GS:ffff89fc304c0000(0000) knlGS:0000000000000000 Jul 05 11:20:17 pop-os kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 05 11:20:17 pop-os kernel: CR2: 0000000000000000 CR3: 00000001abe10006 CR4: 00000000003706e0 Jul 05 11:20:17 pop-os kernel: Call Trace: Jul 05 11:20:17 pop-os kernel: ? _nv026322rm+0xce/0x760 [nvidia] Jul 05 11:20:17 pop-os kernel: ? _nv026327rm+0x1b0/0x1d0 [nvidia] Jul 05 11:20:17 pop-os kernel: ? rm_execute_work_item+0x108/0x120 [nvidia] Jul 05 11:20:17 pop-os kernel: ? os_execute_work_item+0x4c/0x70 [nvidia] Jul 05 11:20:17 pop-os kernel: ? _main_loop+0x8c/0x140 [nvidia] Jul 05 11:20:17 pop-os kernel: ? kthread+0x12f/0x150 Jul 05 11:20:17 pop-os kernel: ? nvidia_modeset_resume+0x30/0x30 [nvidia] Jul 05 11:20:17 pop-os kernel: ? __kthread_bind_mask+0x70/0x70 Jul 05 11:20:17 pop-os kernel: ? ret_from_fork+0x22/0x30 Jul 05 11:20:17 pop-os kernel: Modules linked in: rfcomm ccm cmac algif_hash algif_skcipher af_alg nvidia_uvm(POE) bnep x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm snd_sof_> Jul 05 11:20:17 pop-os kernel: iwlwifi wmi_bmof btintel serio_raw videodev bluetooth snd nvidia(POE) mxm_wmi ee1004 joydev input_leds mc processor_thermal_mbox efi_pstore mei_me processor_> Jul 05 11:20:17 pop-os kernel: xhci_pci_renesas wmi video pinctrl_cannonlake Jul 05 11:20:17 pop-os kernel: CR2: 0000000000000000 Jul 05 11:20:17 pop-os kernel: ---[ end trace ae1af6e86b5f3a0e ]--- Jul 05 11:20:17 pop-os kernel: RIP: 0010:_nv032400rm+0x21c/0x500 [nvidia] Jul 05 11:20:17 pop-os kernel: Code: 8b 87 e0 01 00 00 e8 23 bb 8d e7 45 85 ff 45 89 fc 0f 94 c0 41 f7 d4 41 83 e6 01 75 05 84 45 30 75 2c 48 8b 45 18 48 8b 4d 20 <44> 23 38 44 89 39 44 23 > Jul 05 11:20:17 pop-os kernel: RSP: 0018:ffff964340e0bd80 EFLAGS: 00010246 Jul 05 11:20:17 pop-os kernel: RAX: 0000000000000000 RBX: 0000000000000004 RCX: 0000000000000000 Jul 05 11:20:17 pop-os kernel: RDX: 0000000000000007 RSI: ffff89ed14e20008 RDI: ffff89ed0b39c008 Jul 05 11:20:17 pop-os kernel: RBP: ffff89ed09605f60 R08: 0000000000000001 R09: ffff89ed09605e68 Jul 05 11:20:17 pop-os kernel: R10: ffff89ed0b39c008 R11: 0000000010100000 R12: 00000000fffffeff Jul 05 11:20:17 pop-os kernel: R13: ffff89ed04610010 R14: 0000000000000000 R15: 0000000000000100 Jul 05 11:20:17 pop-os kernel: FS: 0000000000000000(0000) GS:ffff89fc304c0000(0000) knlGS:0000000000000000 Jul 05 11:20:17 pop-os kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 05 11:20:17 pop-os kernel: CR2: 0000000000000000 CR3: 0000000109eb0001 CR4: 00000000003706e0 Jul 05 11:20:24 pop-os kernel: Asynchronous wait on fence NVIDIA:nvidia.prime:3fe timed out (hint:intel_atomic_commit_ready [i915]) Jul 05 11:20:24 pop-os /usr/libexec/gdm-x-session[1999]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x0000bd0c, 0x0000bd14) Jul 05 11:20:27 pop-os /usr/libexec/gdm-x-session[1999]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x0000bd0c, 0x0000bd1c) ```

There were a few lines in there marked in red:

Jul 05 11:20:17 pop-os kernel: BUG: kernel NULL pointer dereference, address: 0000000000000000
Jul 05 11:20:17 pop-os kernel: #PF: supervisor read access in kernel mode
Jul 05 11:20:17 pop-os kernel: #PF: error_code(0x0000) - not-present page