Closed dbrgn closed 2 years ago
Here's a trace with debug symbols:
PID: 65282 (sway)
Signal: 11 (SEGV)
Timestamp: Wed 2021-05-19 14:55:36 CEST (1min 31s ago)
Command Line: sway --my-next-gpu-wont-be-nvidia -V
Executable: /usr/bin/sway
Disk Size: 20.5M
Message: Process 65282 (sway) of user 1000 dumped core.
Stack trace of thread 65282:
#0 0x00007f065a39a158 wl_event_loop_dispatch (libwayland-server.so.0 + 0xb158)
#1 0x00007f065a397c37 wl_display_run (libwayland-server.so.0 + 0x8c37)
#2 0x0000563f609557f2 main (sway + 0x107f2)
#3 0x00007f065a0acb25 __libc_start_main (libc.so.6 + 0x27b25)
#4 0x0000563f60955b6e _start (sway + 0x10b6e)
Stack trace of thread 65285:
#0 0x00007f065a0778ca __futex_abstimed_wait_common64 (libpthread.so.0 + 0x158ca)
#1 0x00007f065a071270 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf270)
#2 0x00007f0655783dc4 n/a (nouveau_dri.so + 0x78adc4)
#3 0x00007f065576b808 n/a (nouveau_dri.so + 0x772808)
#4 0x00007f065a06b259 start_thread (libpthread.so.0 + 0x9259)
#5 0x00007f065a1835e3 __clone (libc.so.6 + 0xfe5e3)
Stack trace of thread 65294:
#0 0x00007f065a0778ca __futex_abstimed_wait_common64 (libpthread.so.0 + 0x158ca)
#1 0x00007f065a071270 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf270)
#2 0x00007f0655783dc4 n/a (nouveau_dri.so + 0x78adc4)
#3 0x00007f065576b808 n/a (nouveau_dri.so + 0x772808)
#4 0x00007f065a06b259 start_thread (libpthread.so.0 + 0x9259)
#5 0x00007f065a1835e3 __clone (libc.so.6 + 0xfe5e3)
Stack trace of thread 65287:
#0 0x00007f065a0778ca __futex_abstimed_wait_common64 (libpthread.so.0 + 0x158ca)
#1 0x00007f065a071270 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf270)
#2 0x00007f0655783dc4 n/a (nouveau_dri.so + 0x78adc4)
#3 0x00007f065576b808 n/a (nouveau_dri.so + 0x772808)
#4 0x00007f065a06b259 start_thread (libpthread.so.0 + 0x9259)
#5 0x00007f065a1835e3 __clone (libc.so.6 + 0xfe5e3)
Stack trace of thread 65289:
#0 0x00007f065a0778ca __futex_abstimed_wait_common64 (libpthread.so.0 + 0x158ca)
#1 0x00007f065a071270 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf270)
#2 0x00007f0655783dc4 n/a (nouveau_dri.so + 0x78adc4)
#3 0x00007f065576b808 n/a (nouveau_dri.so + 0x772808)
#4 0x00007f065a06b259 start_thread (libpthread.so.0 + 0x9259)
#5 0x00007f065a1835e3 __clone (libc.so.6 + 0xfe5e3)
Stack trace of thread 65298:
#0 0x00007f065a0778ca __futex_abstimed_wait_common64 (libpthread.so.0 + 0x158ca)
#1 0x00007f065a071270 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf270)
#2 0x00007f0655783dc4 n/a (nouveau_dri.so + 0x78adc4)
#3 0x00007f065576b808 n/a (nouveau_dri.so + 0x772808)
#4 0x00007f065a06b259 start_thread (libpthread.so.0 + 0x9259)
#5 0x00007f065a1835e3 __clone (libc.so.6 + 0xfe5e3)
Stack trace of thread 65291:
#0 0x00007f065a0778ca __futex_abstimed_wait_common64 (libpthread.so.0 + 0x158ca)
#1 0x00007f065a071270 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf270)
#2 0x00007f0655783dc4 n/a (nouveau_dri.so + 0x78adc4)
#3 0x00007f065576b808 n/a (nouveau_dri.so + 0x772808)
#4 0x00007f065a06b259 start_thread (libpthread.so.0 + 0x9259)
#5 0x00007f065a1835e3 __clone (libc.so.6 + 0xfe5e3)
Stack trace of thread 65301:
#0 0x00007f065a0778ca __futex_abstimed_wait_common64 (libpthread.so.0 + 0x158ca)
#1 0x00007f065a071270 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf270)
#2 0x00007f065576e814 n/a (nouveau_dri.so + 0x775814)
#3 0x00007f065576b7d8 n/a (nouveau_dri.so + 0x7727d8)
#4 0x00007f065a06b259 start_thread (libpthread.so.0 + 0x9259)
#5 0x00007f065a1835e3 __clone (libc.so.6 + 0xfe5e3)
Stack trace of thread 65304:
#0 0x00007f065a0778ca __futex_abstimed_wait_common64 (libpthread.so.0 + 0x158ca)
#1 0x00007f065a071270 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf270)
#2 0x00007f065576e814 n/a (nouveau_dri.so + 0x775814)
#3 0x00007f065576b7d8 n/a (nouveau_dri.so + 0x7727d8)
#4 0x00007f065a06b259 start_thread (libpthread.so.0 + 0x9259)
#5 0x00007f065a1835e3 __clone (libc.so.6 + 0xfe5e3)
Stack trace of thread 65303:
#0 0x00007f065a0778ca __futex_abstimed_wait_common64 (libpthread.so.0 + 0x158ca)
#1 0x00007f065a071270 pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0 + 0xf270)
#2 0x00007f065576e814 n/a (nouveau_dri.so + 0x775814)
#3 0x00007f065576b7d8 n/a (nouveau_dri.so + 0x7727d8)
#4 0x00007f065a06b259 start_thread (libpthread.s
I seem to be able to reproduce some kind of crash by opening a very large number of kitty
windows quickly.
The logs seem to change each time it crashes:
After each of the logs there would also be the following:
Stack trace of thread 8877:
#0 0x00007f50aff5da3c recv (libc.so.6 + 0xffa3c)
#1 0x00007f50aeedf153 n/a (libEGL_nvidia.so.0 + 0x79153)
#2 0x00007f50aeedff1d n/a (libEGL_nvidia.so.0 + 0x79f1d)
#3 0x00007f50aeed507d n/a (libEGL_nvidia.so.0 + 0x6f07d)
#4 0x00007f50aef2271e n/a (libEGL_nvidia.so.0 + 0xbc71e)
#5 0x00007f50afe44259 start_thread (libpthread.so.0 + 0x9259)
#6 0x00007f50aff5c5e3 __clone (libc.so.6 + 0xfe5e3)
for every window of kitty opened.
Doing the same for Firefox or the foot terminal did not lead to this issue, so I think it has to do with kitty and the fact that it uses the GPU for rendering. Perhaps it's linked to this issue?
Looking at crashed thread 7154 it seems like sway recursive rendering lead to stack overflow
I couldn't reproduce on another machine running an AMD iGPU, so I think that it might be linked to the proprietary NVIDIA drivers. This issue also occurs when I open many instances of mpv with -vo=gpu --opengl-es=yes
quickly.
Closing this. Reopen if issue is still here.
I have occasional segfaults in Sway:
The stack trace looks like this:
This library is owned by the
wayland
package (version 1.19.0-1 in my case), however I assume that the root cause of the segfault is in wlroots.Would it help if I'd recompile wayland with debug info?