NVIDIA / open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source
Other
14.17k stars 1.17k forks source link

NO SHARED MEMORY FOR YEARS [NVIDIA_UVM] - BASIC FEATURE #663

Open bioluks opened 2 weeks ago

bioluks commented 2 weeks ago

NVIDIA Open GPU Kernel Modules Version

550.90.07 (latest)

Please confirm this issue does not happen with the proprietary driver (of the same version). This issue tracker is only for bugs specific to the open kernel driver.

Operating System and Version

Multiple Setups (10+), for now on Arch

Kernel Release

multiple ones, right now on "6.9.3-hardened1-1-hardened"

Please confirm you are running a stable release kernel (e.g. not a -rc). We do not accept bug reports for unreleased kernels.

Hardware: GPU

NVIDIA GeForce GTX 1050 Ti

Describe the bug

This is ignored everywhere by NVIDIA employees and devs. Since 2016 we have no solution (I'm sure it was like this even before 2016). Do we need viral tweets and Reddit posts here and there bashing the company so they listen to us at all?

NVIDIA_UVM is not working even when loaded, checked via lsmod, also on a 30 series RTX. "nvidia-modprobe" does nothing. There is no dmesg to show since everything loads successfully. If the VRAM is full there is no backup option (no shared RAM like in Windows systems). We have high end graphics cards with very low VRAM, and it's slowly starting to become a fact they were produced this way on purpose.

I'm obviously annoyed. It's 2024. All other known GPU brands (AMD , Intel) don't have this issue; shared memory works just fine. It's a basic feature that should just work, just like in Windows. The NVIDIA driver still has the most annoying issues on Linux, we know you don't care about Linux users. Wayland issues, late incoming optimus support on laptops etc, you name it. If you hate open source this much don't publish the driver at all and stop further updates. From now on I will vote with my wallet (I know this won't change anything), the internet is begging you for bug fixes and you not caring just shows how you all think we have no alternative out there. For anyone here looking for fixes (there are none at the moment) check out:

  1. NVIDIA Forum Post from 2016 about this very issue
  2. Same issue on a 2023 NVIDIA Forum post with details
  3. Someone also raised this issue in the Discussions, but again. Dead silence.

No error logs are needed at this point, it's known shared memory (nvidia_uvm - unified shared memory) simply does not work.

If you don't want to buy an expensive GPU from NVIDIA, your only bet is to use Windows so your Games/Apps do not crash twhen your VRAM is full. The nvidia_uvm you see in lsmod acts like a placeholder for an empty file. Buy an AMD or Intel GPU for now. Like Linus said this is the worst company they had to deal with.

So the question is when this advertised as working feature of yours will start to work at all?

To Reproduce

Just install the latest proprietary driver and for once test the driver yourself as a dev. NVIDIA_UVM does not work, and if it works you used hidden parameters not known to us. Like mentioned below the nvidia-bug-report.sh script does not work, no matter which parameter passed.

Bug Incidence

Always

nvidia-bug-report.log.gz

nvidia-bug-report.sh is not working no matter what I do, tried the safe mode parameter, reboot etc. Of course ran as root. You have bigger issues if this is even hanging. Since I do not know if a bot/AI manages these issues I will upload an empty log.gz file. nvidia-bug-report.log.gz

More Info

You know the problem better than me. Please check the links I posted. Important forum posts like these should at least get an answer.

cngkyt commented 4 days ago

I bought multiple nvidia cards for a business and they are in rubbish bin now. I have to use windows or i have to use AMD or INTEL cards instead of this rubbish cards They dont have this feature and they wont in a short time. They dont care non profit developments DONT BUY NVIDIA