flathub / com.slack.Slack

https://flathub.org/apps/details/com.slack.Slack
35 stars 36 forks source link

[archlinux/x64/amdgpu] Freeze after a few minutes in screen sharing "huddles" #190

Closed enote-kane closed 1 year ago

enote-kane commented 1 year ago

Just observed (reproducible) some errors (with and without hardware acceleration):

  1. start slack
  2. join a huddle
  3. someone shares the screen
  4. after a few minutes of inactivity (no mouse move), X freezes (mouse can still be moved, connection still active and audio - incl. mic - unaffected)

Found the following in dmesg:

[16097.646270] amdgpu 0000:04:00.0: [drm] *ERROR* [CRTC:67:crtc-0] flip_done timed out
[16369.219599] amdgpu 0000:04:00.0: [drm] *ERROR* flip_done timed out
[16369.219608] amdgpu 0000:04:00.0: [drm] *ERROR* [CRTC:67:crtc-0] commit wait timed out

The zypak-sandbox process then goes into Zombie state:

kane       37313  0.0  0.0   2720  1664 ?        S    11:54   0:00  |   \_ bwrap --args 41 slack --proxy-server=http://127.0.0.1:8118
kane       37333  0.0  0.0   2720  1152 ?        S    11:54   0:00  |       \_ bwrap --args 41 slack --proxy-server=http://127.0.0.1:8118
kane       37334  2.6  0.6 1175690228 175008 ?   Sl   11:54   1:04  |           \_ /app/extra/lib/slack/slack -s --enable-features=WebRTCPipeWireCapturer --proxy-server=http://12
kane       37337  0.0  0.0   5760  1536 ?        S    11:54   0:00  |           |   \_ cat
kane       37338  0.0  0.0   5760  1536 ?        S    11:54   0:00  |           |   \_ cat
kane       37348  0.0  0.1 33775708 49024 ?      S    11:54   0:00  |           |   \_ /app/extra/lib/slack/slack --type=zygote --no-zygote-sandbox --enable-crashpad --enable-cra
kane       37397 20.6  0.6 34227632 174276 ?     Sl   11:54   8:18  |           |   |   \_ /app/extra/lib/slack/slack --type=gpu-process --enable-logging --enable-crashpad --cras
kane       37350  0.0  0.0      0     0 ?        Z    11:54   0:00  |           |   \_ [zypak-sandbox] <defunct>
kane       37419  1.9  0.2 33842084 69488 ?      Sl   11:54   0:47  |           |   \_ /app/extra/lib/slack/slack --type=utility --utility-sub-type=network.mojom.NetworkService -
kane       37681  5.2  0.2 34040988 63572 ?      Sl   11:54   2:05  |           |   \_ /app/extra/lib/slack/slack --type=utility --utility-sub-type=audio.mojom.AudioService --lan
kane       37361  0.0  0.0   2720  1280 ?        S    11:54   0:00  |           \_ bwrap --args 40 /app/bin/zypak-helper child - /app/extra/lib/slack/slack --type=zygote --enable
kane       37362  0.0  0.1 33778080 49664 ?      S    11:54   0:00  |           |   \_ /app/extra/lib/slack/slack --type=zygote --enable-crashpad --enable-crashpad
kane       37465 54.2  1.5 1184640496 428000 ?   Sl   11:54  21:47  |           |       \_ /app/extra/lib/slack/slack --type=renderer --enable-crashpad --crashpad-handler-pid=35 
kane       37381  0.0  0.0 33591376 7424 ?       Sl   11:54   0:00  |           \_ /app/extra/lib/slack/chrome_crashpad_handler --monitor-self-annotation=ptype=crashpad-handler -

...and when trying to kill X:

[15333.120532] ------------[ cut here ]------------
[15333.120532] WARNING: CPU: 15 PID: 35582 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:7500 amdgpu_dm_atomic_commit_tail+0x2c8b/0x2cf0 [amdgpu]
[15333.120705] Modules linked in: snd_seq_dummy snd_hrtimer snd_seq snd_seq_device xt_nat xt_tcpudp veth xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c xt_addrtype iptable_filter br_netfilter bridge stp llc qrtr_mhi overlay vfat fat snd_soc_acp6x_mach snd_soc_dmic snd_acp6x_pdm_dma snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_sof_pci snd_sof_xtensa_dsp qrtr snd_sof ath11k_pci snd_sof_utils snd_soc_core ath11k snd_compress ac97_bus snd_ctl_led intel_rapl_msr qmi_helpers snd_pcm_dmaengine intel_rapl_common snd_hda_codec_realtek btusb snd_hda_codec_generic snd_hda_codec_hdmi snd_pci_ps edac_mce_amd uvcvideo btrtl snd_rpl_pci_acp6x snd_hda_intel mac80211 btbcm snd_acp_pci videobuf2_vmalloc snd_intel_dspcfg snd_intel_sdw_acpi snd_pci_acp6x videobuf2_memops btintel kvm_amd snd_hda_codec videobuf2_v4l2 snd_pci_acp5x libarc4 r8169 snd_rn_pci_acp3x btmtk ucsi_acpi kvm videodev snd_hda_core
[15333.120717]  snd_acp_config realtek cfg80211 think_lmi sp5100_tco mdio_devres typec_ucsi hid_multitouch irqbypass snd_soc_acpi snd_hwdep videobuf2_common bluetooth psmouse rapl mc pcspkr typec wmi_bmof firmware_attributes_class ecdh_generic i2c_piix4 snd_pcm k10temp snd_pci_acp3x libphy snd_timer mhi roles mousedev joydev i2c_hid_acpi i2c_hid amd_pmc acpi_cpufreq acpi_tad mac_hid crypto_user fuse loop bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_lg_g15 usbhid dm_crypt cbc encrypted_keys trusted asn1_encoder tee dm_mod serio_raw atkbd thinkpad_acpi libps2 crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni vivaldi_fmap polyval_generic gf128mul ghash_clmulni_intel ledtrig_audio sha512_ssse3 platform_profile aesni_intel nvme crypto_simd cryptd snd nvme_core xhci_pci ccp xhci_pci_renesas soundcore i8042 nvme_common serio rfkill amdgpu drm_ttm_helper ttm video wmi drm_buddy gpu_sched drm_display_helper cec
[15333.120731] CPU: 15 PID: 35582 Comm: Xorg Tainted: G        W          6.2.5-arch1-1 #1 fcf70e9d97e045884ea945a3d5b5ff73b06f7a27
[15333.120732] Hardware name: LENOVO 21J5002FGE/21J5002FGE, BIOS R23ET60W (1.30 ) 09/14/2022
[15333.120732] RIP: 0010:amdgpu_dm_atomic_commit_tail+0x2c8b/0x2cf0 [amdgpu]
[15333.120903] Code: c4 18 83 bd 10 fd ff ff 02 77 26 c7 85 10 fd ff ff 02 00 00 00 e9 f4 fd ff ff 0f 0b 0f 0b e9 d0 f5 ff ff 0f 0b e9 5f f5 ff ff <0f> 0b e9 e1 f5 ff ff 89 9d 08 fd ff ff 41 ba 02 00 00 00 8b 9d 10
[15333.120903] RSP: 0018:ffffb7bccb707568 EFLAGS: 00010082
[15333.120904] RAX: 0000000000000001 RBX: 0000000000000286 RCX: 0000000000000020
[15333.120904] RDX: 0000000000000001 RSI: 0000000000000297 RDI: ffffa0dccdf80178
[15333.120905] RBP: ffffb7bccb7078c8 R08: ffffb7bccb707494 R09: 0000000000000002
[15333.120905] R10: 0000000000000001 R11: 0000000000000000 R12: ffffa0dccdd9a118
[15333.120906] R13: 0000000000000000 R14: ffffa0ddc29e6c00 R15: ffffa0dccdd9a000
[15333.120906] FS:  00007fb6ca57f400(0000) GS:ffffa0e3221c0000(0000) knlGS:0000000000000000
[15333.120907] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[15333.120907] CR2: 0000564a85ad4008 CR3: 00000001d4c0e000 CR4: 0000000000750ee0
[15333.120908] PKRU: 55555554
[15333.120908] Call Trace:
[15333.120908]  <TASK>
[15333.120912]  commit_tail+0x94/0x130
[15333.120913]  drm_atomic_helper_commit+0x116/0x140
[15333.120914]  drm_atomic_commit+0x9a/0xd0
[15333.120915]  ? __pfx___drm_printfn_info+0x10/0x10
[15333.120916]  drm_client_modeset_commit_atomic+0x206/0x250
[15333.120918]  drm_client_modeset_commit_locked+0x5a/0x160
[15333.120919]  ? sched_clock_cpu+0x5d/0xb0
[15333.120920]  drm_fb_helper_set_par+0x7f/0x100
[15333.120921]  fb_set_var+0x204/0x420
[15333.120922]  ? __wake_up_common+0x76/0x180
[15333.120923]  ? __wake_up_common_lock+0x8f/0xd0
[15333.120924]  fbcon_blank+0x213/0x310
[15333.120926]  do_unblank_screen+0xac/0x160
[15333.120927]  vt_ioctl+0xb22/0x13c0
[15333.120928]  ? fsnotify+0x616/0x840
[15333.120930]  ? fsnotify+0x51e/0x840
[15333.120931]  tty_ioctl+0x292/0x8b0
[15333.120932]  ? dput+0x3a/0x310
[15333.120932]  ? __slab_free+0xe0/0x310
[15333.120934]  ? __slab_free+0xe0/0x310
[15333.120935]  __x64_sys_ioctl+0x94/0xd0
[15333.120936]  do_syscall_64+0x5f/0x90
[15333.120937]  ? exit_to_user_mode_prepare+0x145/0x1d0
[15333.120938]  ? syscall_exit_to_user_mode+0x1b/0x40
[15333.120939]  ? do_syscall_64+0x6b/0x90
[15333.120940]  ? do_syscall_64+0x6b/0x90
[15333.120941]  ? do_syscall_64+0x6b/0x90
[15333.120943]  entry_SYSCALL_64_after_hwframe+0x72/0xdc
[15333.120944] RIP: 0033:0x7fb6caf5053f
[15333.120946] Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 18 48 8b 44 24 18 64 48 2b 04 25 28 00 00
[15333.120947] RSP: 002b:00007ffd2b9d4f90 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[15333.120947] RAX: ffffffffffffffda RBX: 0000564a83897ee0 RCX: 00007fb6caf5053f
[15333.120948] RDX: 0000000000000000 RSI: 0000000000004b3a RDI: 000000000000000f
[15333.120948] RBP: 0000000000000001 R08: 0000564a858b1b60 R09: 0000000000000000
[15333.120948] R10: 00007fb6cb2a5c50 R11: 0000000000000246 R12: 0000000000000000
[15333.120949] R13: 00000000ffffffff R14: 0000564a8389cdf0 R15: 0000000000000000
[15333.120950]  </TASK>
[15333.120950] ---[ end trace 0000000000000000 ]---

I am not sure whether it is an issue with Slack, zypak or amdgpu/DRM.

nerijus commented 1 year ago

The last one - amdgpu/DRM. Please check/report here - https://gitlab.freedesktop.org/drm/amd/-/issues

enote-kane commented 1 year ago

@nerijus Thanx for the pointer, I think my issue is already reported and seems to also occur with other applications.

Reference: https://gitlab.freedesktop.org/drm/amd/-/issues/2443

I'm closing this one and will instead watch the upstream bug.