Open FlyGoat opened 10 months ago
Can confirm the issue on NixOS as well. Has been happening for earlier kernel versions on my system as well but currently it is Nvidia: Hardware: GeForce 3080Ti (mobile) Version: 545.29.06 Kernel: 6.6.8
In my case most of the time this happens when waking up again from suspend or the machine sitting around unused for some time
Same here, just hangup after updating to 545 today
After showing the above error message, I found a line at the very end hdaudio hdaudioCOD2:unable to configure disabling
. After searching online, I tried the fix as below, and able to log back into ubuntu normally.
# use recovery mode and enter root shell
# nano /etc/default/grub and make below change
# original
"GRUB_LINUX_DEFAULT="quiet splash nomodeset"
# to
"GRUB_LINUX_DEFAULT="quiet splash"
# then `update-grub` and `reboot`
Nvidia folks, any chance we can get this fixed in next release?
This bug is not fun! At a minimum its an indefinite delay and at worst its a crash on shutdown with no ability to open a tty
and save the day.
This gets spammed in dmesg
:
NVRM rmapiAllocWithSecInfo: RMAPI_GPU_LOCK_INTERNAL alloc requested without holding the RMAPI lock
I also get this spam in dmesg
too:
[ 122.596895] NVRM nbsiReadRegistryDword: osReadRegistryDword called in Sleep path can cause excessive delays!
[ 122.596903] NVRM nvAssertFailedNoLog: Assertion failed: 0 @ nbsi_osrg.c:107
$ cat /proc/driver/nvidia/version
outputs:
NVRM version: NVIDIA UNIX Open Kernel Module for x86_64 545.23.08 Release Build (dvs-builder@U16-I3-A16-1-1)
Hi, thanks for the report! This is tracked internally as bug 4074148. Hard to say which release will get the fix due to schedules and release branching.
For whatever it's worth, the root cause of the print from that particular call stack (rm_power_management()
) was found to be "harmless", except for the print spam. Any other issues you are seeing are likely to be independent and deserve a separate bug report.
Still happening with 550.78.
[ 15.007542] NVRM: loading NVIDIA UNIX Open Kernel Module for x86_64 550.78 Release Build (dvs-builder@U16-I1-N08-06-4) Sun Apr 14 06:38:24 UTC 2024
[ 15.302560] NVRM: testIfDsmSubFunctionEnabled: GPS ACPI DSM called before _acpiDsmSupportedFuncCacheInit subfunction = 11.
[ 36.803975] NVRM: rmapiAllocWithSecInfo: RMAPI_GPU_LOCK_INTERNAL alloc requested without holding the RMAPI lock
NVIDIA Open GPU Kernel Modules Version
545.29.06
Please confirm this issue does not happen with the proprietary driver (of the same version). This issue tracker is only for bugs specific to the open kernel driver.
Operating System and Version
Arch Linux
Kernel Release
6.6.7
Please confirm you are running a stable release kernel (e.g. not a -rc). We do not accept bug reports for unreleased kernels.
Hardware: GPU
NVIDIA GeForce RTX 4060 (10de:28e0)
Describe the bug
dmesg spam with:
To Reproduce
Boot on such system, and check dmesg.
Bug Incidence
Always
nvidia-bug-report.log.gz
nvidia-bug-report.log.gz
More Info
I tried to debug the issue by appending a
os_stack_trace()
after where the message is printed, I got the following backtrace:Backtraces are almost the same each time it printed.