Open mmstick opened 1 month ago
We'll also need to update https://github.com/pop-os/egl-wayland to make use of explicit sync for Wayland EGL, presumably. I assume Nvidia will tag a release of egl-wayland when the stable 555 driver is released.
I don't seem to see Nvidia's driver initializing an EGL display correctly for X11 or Wayland? Might be something with my Pop install since I have done some testing of different driver configurations here.
Is it working for others?
egl-wayland updates: https://github.com/pop-os/egl-wayland/pull/3 They haven't tagged a new release yet, but there is a patch for explicit sync.
Looks like I was missing __ NV_PRIME_RENDER_OFFLOAD=1
. Without that the Nvidia Wayland EGL backend won't initialize. (So it falls back to mesa, or fails if __EGL_VENDOR_LIBRARY_FILENAMES=/usr/share/glvnd/egl_vendor.d/10_nvidia.json
is set).
Maybe Nvidia doesn't support EGL for X11. (Instead of glx)...
I'm not positive, as I cannot recreate it, but I did see an issue where I could "scroll" an external display. When I reverted the driver to the current release, I no longer had the problem. When I re-added 555, I couldn't cause it again.
To describe the video (In case it's not fully clear):
I'm currently trying to fully eliminate any other packages.
Everything seems to be checking out with this driver so far, besides one potential regression. With a 10-series desktop card, some machines have trouble resuming from suspend after being left in suspend for a while. We recreated this in this 555 version, and we are currently trying to recreate it in the released 550 driver to determine whether it's a regression or not.
It looks like the failure to resume for desktops with 10 series cards is - in fact - a regression.
I've pulled (and truncated) logs from the resume process. There's a kernel bug that crops up. 10xx_resume_555_fail.log The kernel bug specifically calls out nvidia as part of it's output.
Right now, I'm checking against the new (6.9.3) kernel PR, in case there is some mismatch between this PR and our current shipped kernel.
Well, it looks like - while 6.9.3 does result in sometimes being able to resume - the issue of a non-responsive system on resume is still present. We'll either need to dig down ourselves - for which, some guidance would be helpful - or we'll need wait.
My instinct is that this isn't fully a driver issue. It might be a gnome/configuration issue.
I think we should look to try a newer Nvidia beta driver for 555 or otherwise wait. The resumption issues on the 10xx series GPUs is a blocker for releasing this, and there appear to be issues coming back from suspend regardless of DE. I can log in to Cosmic via cosmic-greeter reliably, but the compositor doesn't seem to be operating correctly as I get issues like the following screenshot:
Using GDM, even with the newer Kernel PR, I see either a black screen with a mouse cursor, or the same non-responsive black screen as before the vast majority of the time.
https://www.nvidia.co.uk/download/driverResults.aspx/224793/en-us