Closed skinkie closed 4 years ago
Would you mind attaching your dmesg from when that happened?
I don't know if it is this one:
[48198.201005] WARNING: CPU: 0 PID: 682 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn10/dcn10_hw_sequencer.c:971 dcn10_verify_allow_pstate_change_high+0x25/0x215 [amdgpu]
[48198.201007] Modules linked in: ctr ccm rfcomm cmac bnep joydev zfs(PO) zunicode(PO) zlua(PO) zavl(PO) icp(PO) wmi_bmof zcommon(PO) znvpair(PO) spl(O) zlib_deflate zlib_inflate amdgpu snd_hda_codec_conexant snd_hda_codec_generic ath10k_pci mfd_core btusb gpu_sched btrtl i2c_algo_bit btbcm ath10k_core snd_hda_codec_hdmi ttm btintel aesni_intel glue_helper ath crypto_simd snd_hda_intel sdhci_pci cryptd snd_intel_dspcfg bluetooth drm_kms_helper iosf_mbi snd_hda_codec wireguard(O) ecdh_generic mac80211 cqhci syscopyarea ecc snd_hda_core sysfillrect ip6_udp_tunnel sysimgblt r8169 fb_sys_fops udp_tunnel thinkpad_acpi cfg80211 snd_pcm sdhci nvram mmc_core snd_timer ledtrig_audio realtek libarc4 snd evdev soundcore libphy rfkill i2c_piix4 psmouse video wmi backlight ac battery i2c_scmi button sch_fq_codel drm i2c_core efivarfs
[48198.201020] CPU: 0 PID: 682 Comm: Xorg Tainted: P W O 5.5.8-gentoo #2
[48198.201021] Hardware name: LENOVO 20KU000NMH/20KU000NMH, BIOS R0UET76W (1.56 ) 11/05/2019
[48198.201047] RIP: 0010:dcn10_verify_allow_pstate_change_high+0x25/0x215 [amdgpu]
[48198.201048] Code: 41 5e 41 5f c3 0f 1f 44 00 00 55 53 48 89 fb 48 8b 87 20 03 00 00 48 8b b8 b0 01 00 00 e8 5c f3 00 00 84 c0 0f 85 ed 01 00 00 <0f> 0b 31 ed 80 bb ab 01 00 00 00 0f 84 dc 01 00 00 48 8b 83 20 03
[48198.201049] RSP: 0018:ffffc90000de7990 EFLAGS: 00010246
[48198.201050] RAX: 0000000000000000 RBX: ffff888433640000 RCX: 0000000000000007
[48198.201051] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff88844ec13b20
[48198.201051] RBP: ffff888433640000 R08: 0000000000000401 R09: ffffffff821e93e0
[48198.201052] R10: 0000000000000000 R11: ffffffff821e980a R12: 0000000000000000
[48198.201053] R13: ffff88824aa501b8 R14: ffff88824aa51d78 R15: ffff888344f5e600
[48198.201054] FS: 00007f2a689aa940(0000) GS:ffff88844ec00000(0000) knlGS:0000000000000000
[48198.201054] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[48198.201055] CR2: 0000000001a60000 CR3: 0000000428f78000 CR4: 00000000003406f0
[48198.201056] Call Trace:
[48198.201082] dcn10_pipe_control_lock+0x2a/0x69 [amdgpu]
[48198.201104] dc_stream_set_cursor_attributes+0xf9/0x104 [amdgpu]
[48198.201135] handle_cursor_update+0x1ee/0x28e [amdgpu]
[48198.201140] drm_atomic_helper_async_commit+0x62/0xb4 [drm_kms_helper]
[48198.201143] drm_atomic_helper_commit+0x32/0xfb [drm_kms_helper]
[48198.201146] drm_atomic_helper_update_plane+0xd2/0xec [drm_kms_helper]
[48198.201151] drm_mode_cursor_universal+0x19d/0x219 [drm]
[48198.201157] drm_mode_cursor_common+0x10a/0x1e7 [drm]
[48198.201161] ? drm_mode_setplane+0x218/0x218 [drm]
[48198.201166] drm_mode_cursor_ioctl+0x3f/0x59 [drm]
[48198.201170] drm_ioctl_kernel+0x92/0xe6 [drm]
[48198.201175] drm_ioctl+0x1fe/0x2d8 [drm]
[48198.201179] ? drm_mode_setplane+0x218/0x218 [drm]
[48198.201181] ? tracer_hardirqs_on+0xc/0x1c
[48198.201182] ? trace_hardirqs_on+0x2e/0x3a
[48198.201203] amdgpu_drm_ioctl+0x4a/0x76 [amdgpu]
[48198.201205] vfs_ioctl+0x1e/0x2b
[48198.201206] do_vfs_ioctl+0x573/0x59d
[48198.201208] ? do_sigaltstack.constprop.0+0x8e/0xe7
[48198.201209] ksys_ioctl+0x3c/0x5b
[48198.201210] __x64_sys_ioctl+0x16/0x19
[48198.201212] do_syscall_64+0x16b/0x197
[48198.201213] ? trace_hardirqs_off_caller+0x41/0x44
[48198.201214] ? trace_hardirqs_off_thunk+0x1a/0x2b
[48198.201215] entry_SYSCALL_64_after_hwframe+0x49/0xbe
[48198.201216] RIP: 0033:0x7f2a695a6737
[48198.201217] Code: 00 00 00 75 0c 48 c7 c0 ff ff ff ff 48 83 c4 18 c3 e8 1d c8 01 00 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 29 e7 0c 00 f7 d8 64 89 01 48
[48198.201218] RSP: 002b:00007ffe88c4f408 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[48198.201219] RAX: ffffffffffffffda RBX: 000000000000000a RCX: 00007f2a695a6737
[48198.201220] RDX: 00007ffe88c4f410 RSI: 00000000c01c64a3 RDI: 000000000000000a
[48198.201220] RBP: 00000000c01c64a3 R08: 0000000000000001 R09: 0000000000003fff
[48198.201221] R10: 0000000000000780 R11: 0000000000000246 R12: 00007ffe88c4f410
[48198.201222] R13: 000000000000037b R14: 0000000000000422 R15: 0000000000000422
[48198.201223] ---[ end trace 7afe5f2458805dfe ]---
[48198.201838] [drm] pstate TEST_DEBUG_DATA: 0x3FFE0000
Compiling with clang worked again. I'll run it again with gcc to be sure about the kernel trace.
Crash is consistent but nothing in dmesg.
I think it might have been a memory allocation issue with GCC (just a hunch - which was why I asked for the dmesg) you'll have seen OOM messages
No, OOM messages. 16GB of memory + swap.
If cc1 is segfaulting there should be something in your dmesg
@FireBurn sadly it isn't... maybe I should enable some logging?
If you use sytemd, it might be in journalctl
Do you have a working build now? If so are you happy for this to be closed?
Build working with clang.