Open radonnachie opened 4 years ago
Starting at v1.3.9 the libknet8.so library is downloaded as an Artifact and the CUDA toolkit is no longer needed (at least in theory). Could you check if v1.3.9 (latest released version) works?
The above was produced with v1.3.9 (I had used release before, but posted comes from the Knet master branch [see output of pkg> status), which did download libknet8.so.
I have everything working now with the following versions: (not working) -> (working)
Julia v1.5, CUDA#master, Knet#master, CUDA Toolkit v11.0.3, GeForce Driver v451.82, Windows 10. -> Julia v1.4, CUDA.jl#master v1.2.1, Knet v1.3.9, CUDA Toolkit 10.2.89 + CUDNN v8.0.2.39, GeForce Driver v451.82, Windows 10.
Sorry that I do not know what was wrong with using CUDA toolkit 11. I also am not in a position to risk the setup's integrity to test further. I'm even nervous to update Julia to v1.5...
As a notification though, Pkg.test("Knet") has a number of errors due to:
Got exception outside of a @test
could not load symbol "cudnnSetRNNDescriptor": The specified procedure could not be found.
This latest cudnnSetRNNDescriptor error is a new feature introduced by CUDNN v8. The next release of Knet (https://github.com/denizyuret/Knet.jl/pull/596) fixes this and hopefully makes the installation more robust.
Alright, at that point I will test the full setup again! (Julia v1.5, CUDA#master, Knet#master, CUDA Toolkit v11.0.3, GeForce Driver v451.82, Windows 10.
)
Let's try this with Knet v1.4.0, see if it is fixed.
With Julia v1.5, CUDA#master, Knet#master, CUDA Toolkit v11.0.3, GeForce Driver v451.82, Windows 10.
Before I detail the errors, I must say that I had faced the lib not defined before 1.3.6 was released and have read up on as many of @denizyuret's posts to similar (but now outdated) issue-posts.
I know that the
lib not found error
means that Knet was not able to locate the CUDA installation. I have the environment variableCUDA_PATH
set to the CUDA installation. This pleases CUDA, but they mention that the variable's name could also beCUDA_HOME
. I wonder if that is required for Knet.Or are the Knet tests outdated, and their failure non-consequential?
I have my Nvidia drivers updated (v451.36), and CUDA Toolkit V11.0.3 installed: