Closed kusayuzayushko closed 5 months ago
The "unknown error" message comes from cuda and is usually an indication of mismatches libraries/headers. However we have a check for that that prints a warning at an earlier point so something strange is going on here. The other strange thing is that cudnn-auto
should switch to cuda-fp16
on the RTX4090, unless there is a second nvidia gpu on this system - this may also be a symptom of the previous problem.
Can you verify which cuda libraries are used (e.g. with ldd
or similar) and that /opt/cuda/bin/nvcc
is for cuda 12.2?
Also note that for RTX4090 the cuda-auto backend is usually better than the cudnn-auto.
When running lc0, try adding a backend parameter --backend=cuda-auto
If that doesn't work, try --backend=cuda-fp16
as borg323 mentioned this backend is appropriate for RTX4090.
Additionally, check out https://github.com/LeelaChessZero/lc0/discussions/1904
archlinux, after building from scratch, getting cuda error:
Hardware info: OS Archlinux NVIDIA GeForce RTX 4090 Driver Version: 535.104.05 lc0 git branch release/0.30
Build info: