ValveSoftware / steam-for-linux

Issue tracking for the Steam for Linux beta client
4.23k stars 174 forks source link

Screen freeze after closing client #8636

Open sardior opened 2 years ago

sardior commented 2 years ago

Your system information

Please describe your issue in as much detail as possible:

Every time I close the client my screen completely freezes to the point where I cant even switch ttys and have to hard reset my pc.

xbps-query -s nvidia | grep common
[*] nvidia-libs-515.48.07_1          NVIDIA drivers for linux - common libraries
[*] nvidia-libs-32bit-515.48.07_1    NVIDIA drivers for linux - common libraries (32bit)

This is probably nvidia driver related, it started happening after I did a system update and looks like steam exits completely:

Installing breakpad exception handler for appid(steam)/version(1654574690)
Installing breakpad exception handler for appid(steam)/version(1654574690)
Installing breakpad exception handler for appid(steam)/version(1654574690)

(steam:5448): GLib-GObject-CRITICAL **: g_object_unref: assertion \`G_IS_OBJECT (object)' failed

(steam:5448): GLib-GObject-WARNING **: invalid (NULL) pointer instance

(steam:5448): GLib-GObject-CRITICAL **: g_signal_handlers_disconnect_matched: assertion \`G_TYPE_CHECK_INSTANCE (instance)' failed

(steam:5448): GLib-GObject-CRITICAL **: g_object_unref: assertion \`G_IS_OBJECT (object)' failed
[2022-06-18 09:12:05] Startup - updater built Jun  7 2022 03:18:19
[2022-06-18 09:12:05] Loading cached metrics from disk (/home/slawo/.local/share/Steam/package/steam_client_metrics.bin)
[2022-06-18 09:12:05] Using the following download hosts for Public, Realm steamglobal
[2022-06-18 09:12:05] 1. https://cdn.akamai.steamstatic.com, /client/, Realm 'steamglobal', weight was 100, source = 'update_hosts_cached.vdf'
[2022-06-18 09:12:05] 2. https://cdn.cloudflare.steamstatic.com, /client/, Realm 'steamglobal', weight was 100, source = 'update_hosts_cached.vdf'
[2022-06-18 09:12:05] 3. http://media.steampowered.com, /client/, Realm 'steamglobal', weight was 1, source = 'baked in'
[2022-06-18 09:12:05] Verifying installation...
[2022-06-18 09:12:05] Verification complete
[2022-06-18 09:18:48] Shutdown

Tried the workarounds suggested in #8135 as the issue seams similar, but they didn't help.

Steps for reproducing this issue:

  1. close steam or kill steam pid
  2. wait a few sec
kisak-valve commented 2 years ago

Hello @sardior, no OpenGL application including Steam should be capable of taking down the entire system. This reads like a video driver or kernel issue. If you have a second system, it might be interesting to check if you can SSH into the affected computer and check kernel logs.

sardior commented 2 years ago

Now that you mention it, the syslog is just fine and I had music playback one time and that didn't stop after the freeze

Jun 18 09:18:48 [kernel] [  465.557541] general protection fault, probably for non-canonical address 0x6b6b6b6b00000008: 0000 [#1] SMP NOPTI
Jun 18 09:18:48 [kernel] [  465.557548] CPU: 15 PID: 5635 Comm: steamwebhelper Tainted: P           O      5.15.47_1 #1
Jun 18 09:18:48 [kernel] [  465.557553] Hardware name: Micro-Star International Co., Ltd. MS-7B09/X399 GAMING PRO CARBON AC (MS-7B09), BIOS 1.70 12/18/2017
Jun 18 09:18:48 [kernel] [  465.557555] RIP: 0010:_nv035597rm+0xb0/0xe0 [nvidia]
Jun 18 09:18:48 [kernel] [  465.557979] Code: 89 c2 48 89 ef 48 8d b1 50 01 00 00 4c 89 e9 e8 76 57 ff ff 66 0f 1f 44 00 00 48 89 ef e8 e8 57 ff ff 84 c0 74 8a 48 8b 75 00 <48> 39 5e 08 75 ea 4c 39 26 75 e5 49 8b 44 24 20 48 8d b8 50 01 00
Jun 18 09:18:48 [kernel] [  465.557983] RSP: 0018:ffffb49195b33b18 EFLAGS: 00010202
Jun 18 09:18:48 [kernel] [  465.557986] RAX: 0000000000000001 RBX: ffff90313af59030 RCX: ffff9030ab454980
Jun 18 09:18:48 [kernel] [  465.557989] RDX: 6b6b6b6b6b6b6b6b RSI: 6b6b6b6b00000000 RDI: ffff9030b5b92d00
Jun 18 09:18:48 [kernel] [  465.557990] RBP: ffff9030b5b92d00 R08: 0000000000000020 R09: ffff9030b5b92d08
Jun 18 09:18:48 [kernel] [  465.557992] R10: ffff9030b6c04008 R11: 0000000000000001 R12: ffff9030ebfd9358
Jun 18 09:18:48 [kernel] [  465.557994] R13: 6b6b6b6b00000000 R14: ffff9030b5b92d78 R15: ffff90313af59030
Jun 18 09:18:48 [kernel] [  465.557996] FS:  00007fadd4e84240(0000) GS:ffff90387edc0000(0000) knlGS:0000000000000000
Jun 18 09:18:48 [kernel] [  465.557999] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 18 09:18:48 [kernel] [  465.558001] CR2: 00007fe55bfde910 CR3: 00000008ac624000 CR4: 00000000003506e0
Jun 18 09:18:48 [kernel] [  465.558003] Call Trace:
Jun 18 09:18:48 [kernel] [  465.558006]  <TASK>
Jun 18 09:18:48 [kernel] [  465.558008]  ? _nv014044rm+0x31e/0x7e0 [nvidia]
Jun 18 09:18:48 [kernel] [  465.558424]  ? _nv035901rm+0xac/0xe0 [nvidia]
Jun 18 09:18:48 [kernel] [  465.558804]  ? _nv037373rm+0xac/0x140 [nvidia]
Jun 18 09:18:48 [kernel] [  465.559218]  ? _nv037372rm+0x2f7/0x4d0 [nvidia]
Jun 18 09:18:48 [kernel] [  465.559631]  ? _nv037367rm+0x60/0x70 [nvidia]
Jun 18 09:18:48 [kernel] [  465.560044]  ? _nv037368rm+0x7b/0xb0 [nvidia]
Jun 18 09:18:48 [kernel] [  465.560458]  ? _nv035811rm+0x40/0xe0 [nvidia]
Jun 18 09:18:48 [kernel] [  465.560839]  ? _nv000621rm+0x68/0x80 [nvidia]
Jun 18 09:18:48 [kernel] [  465.561196]  ? rm_cleanup_file_private+0xfa/0x180 [nvidia]
Jun 18 09:18:48 [kernel] [  465.561555]  ? nvidia_close+0x151/0x2c0 [nvidia]
Jun 18 09:18:48 [kernel] [  465.561893]  ? nvidia_frontend_close+0x2b/0x50 [nvidia]
Jun 18 09:18:48 [kernel] [  465.562235]  ? __fput+0x8c/0x240
Jun 18 09:18:48 [kernel] [  465.562239]  ? task_work_run+0x62/0x90
Jun 18 09:18:48 [kernel] [  465.562242]  ? do_exit+0x34d/0xaa0
Jun 18 09:18:48 [kernel] [  465.562246]  ? do_group_exit+0x33/0xa0
Jun 18 09:18:48 [kernel] [  465.562249]  ? __x64_sys_exit_group+0x14/0x20
Jun 18 09:18:48 [kernel] [  465.562251]  ? do_syscall_64+0x5c/0x80
Jun 18 09:18:48 [kernel] [  465.562256]  ? do_syscall_64+0x69/0x80
Jun 18 09:18:48 [kernel] [  465.562259]  ? exit_to_user_mode_prepare+0x32/0x150
Jun 18 09:18:48 [kernel] [  465.562263]  ? syscall_exit_to_user_mode+0x23/0x40
Jun 18 09:18:48 [kernel] [  465.562266]  ? __x64_sys_close+0xe/0x40
Jun 18 09:18:48 [kernel] [  465.562270]  ? do_syscall_64+0x69/0x80
Jun 18 09:18:48 [kernel] [  465.562273]  ? syscall_exit_to_user_mode+0x23/0x40
Jun 18 09:18:48 [kernel] [  465.562275]  ? do_syscall_64+0x69/0x80
Jun 18 09:18:48 [kernel] [  465.562278]  ? do_syscall_64+0x69/0x80
Jun 18 09:18:48 [kernel] [  465.562281]  ? asm_exc_page_fault+0x8/0x30
Jun 18 09:18:48 [kernel] [  465.562285]  ? entry_SYSCALL_64_after_hwframe+0x44/0xae
Jun 18 09:18:48 [kernel] [  465.562288]  </TASK>
Jun 18 09:18:48 [kernel] [  465.562290] Modules linked in: rfcomm rpcsec_gss_krb5 auth_rpcgss nfsv4 xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle ip6table_nat ip6table_filter ip6_tables iptable_mangle iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter ip_tables x_tables bridge cmac algif_hash algif_skcipher af_alg bnep cfg80211 nfs lockd grace 8021q garp mrp stp llc sunrpc fscache nls_iso8859_1 nls_cp437 vfat fat snd_seq_midi snd_seq_midi_event nvidia_drm(PO) nvidia_modeset(PO) intel_rapl_msr nvidia(PO) intel_rapl_common edac_mce_amd snd_hda_codec_realtek kvm_amd snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi drm_kms_helper kvm cec snd_hda_intel irqbypass snd_intel_dspcfg rc_core snd_intel_sdw_acpi rapl wmi_bmof pcspkr drm snd_hda_codec snd_usb_audio igb btusb snd_hda_core snd_usbmidi_lib btrtl agpgart sp5100_tco btbcm i2c_algo_bit syscopyarea hid_logitech_hidpp joydev input_leds sysfillrect snd_rawmidi btintel snd_hwdep k10temp
Jun 18 09:18:48 [kernel] [  465.562424] ---[ end trace 94012ae261baa2f9 ]---
Jun 18 09:18:48 [kernel] [  465.562427] RIP: 0010:_nv035597rm+0xb0/0xe0 [nvidia]
Jun 18 09:18:48 [kernel] [  465.562843] Code: 89 c2 48 89 ef 48 8d b1 50 01 00 00 4c 89 e9 e8 76 57 ff ff 66 0f 1f 44 00 00 48 89 ef e8 e8 57 ff ff 84 c0 74 8a 48 8b 75 00 <48> 39 5e 08 75 ea 4c 39 26 75 e5 49 8b 44 24 20 48 8d b8 50 01 00
Jun 18 09:18:48 [kernel] [  465.562847] RSP: 0018:ffffb49195b33b18 EFLAGS: 00010202
Jun 18 09:18:48 [kernel] [  465.562850] RAX: 0000000000000001 RBX: ffff90313af59030 RCX: ffff9030ab454980
Jun 18 09:18:48 [kernel] [  465.562852] RDX: 6b6b6b6b6b6b6b6b RSI: 6b6b6b6b00000000 RDI: ffff9030b5b92d00
Jun 18 09:18:48 [kernel] [  465.562854] RBP: ffff9030b5b92d00 R08: 0000000000000020 R09: ffff9030b5b92d08
Jun 18 09:18:48 [kernel] [  465.562856] R10: ffff9030b6c04008 R11: 0000000000000001 R12: ffff9030ebfd9358
Jun 18 09:18:48 [kernel] [  465.562858] R13: 6b6b6b6b00000000 R14: ffff9030b5b92d78 R15: ffff90313af59030
Jun 18 09:18:48 [kernel] [  465.562861] FS:  00007fadd4e84240(0000) GS:ffff90387edc0000(0000) knlGS:0000000000000000
Jun 18 09:18:48 [kernel] [  465.562863] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 18 09:18:48 [kernel] [  465.562866] CR2: 00007fe55bfde910 CR3: 00000008ac624000 CR4: 00000000003506e0
Jun 18 09:18:48 [kernel] [  465.562869] Fixing recursive fault but reboot is needed!

Looks like nvidia driver crash triggered by steamwebhelper, I dont really understand much else :smile:

sardior commented 2 years ago

In today's logs I have 6 crashes like this, 4 of them start with "steamwebhelper Tainted" and 2 with "Xorg Tainted". The Xorg ones looks similar with a shorter stack trace.