Closed roryck closed 2 years ago
Yeah, the nvhpc install doesn't put them in there, but instead puts them into:
/glade/u/apps/common/22.08/spack/opt/spack/nvhpc/22.7/gcc/7.5.0/Linux_x86_64/22.7/math_libs/11.7/targets/x86_64-linux/lib/libcublas.so.11
You should see the same for newer CUDAs on Casper too (like 11.6
)... but I'll double-check.
I think on Casper I add the math_libs stuff to the NCAR wrapper paths and the LD_LIBRARY_PATH in the CUDA module - does that work for your purposes?
Yes, if it gets added to the LD_LIBRARY_PATH in the CUDA module I believe it will get picked up by tensorflow, which is where this is failing for me.
We will do dedicated CUDA installs via Spack to resolve this issue from now on (rather than pulling the semi-broken CUDA from the nvhpc install).
On casper the base CUDA lib dir
contains some numerical libraries that are missing from what appears to be the equivalent directory on Gust
The missing libs that I've hit so far are: