NVIDIA / data-science-stack

NVIDIA Data Science stack tools
Apache License 2.0
373 stars 57 forks source link

Pytorch and Tensorflow no GPU but Nvidia SMI okay #126

Open Djdraper12 opened 2 years ago

Djdraper12 commented 2 years ago

torch.cuda.is_available() is showing False. as is the tensorflow

Nvidia SMI output: +-----------------------------------------------------------------------------+ | NVIDIA-SMI 465.19.01 Driver Version: 465.19.01 CUDA Version: 11.3 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 NVIDIA GeForce ... On | 00000000:01:00.0 On | N/A | | 0% 43C P8 13W / 260W | 1067MiB / 7979MiB | 26% Default | | | | N/A | +-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | 0 N/A N/A 2336 G /usr/lib/xorg/Xorg 102MiB | | 0 N/A N/A 4822 G /usr/lib/xorg/Xorg 333MiB | | 0 N/A N/A 5011 G /usr/bin/gnome-shell 56MiB | | 0 N/A N/A 13415 G ...AAAAAAAAA= --shared-files 558MiB |

Doesn't seem to be any issues with cuda install? any ideas how to get this working within the stack? Pytorch version is 1.8 could this be the reason and how can we upgrade individual packages within the stack?

Djdraper12 commented 2 years ago

@bmwshop anu luck on this got so fed up reinstalled ubuntu and cuda is now 11.4 pytorch is 1.8 and still not working?

EkinsMatthew commented 1 year ago

I have the same problem as the author. nvidia-smi shows my GPU but I have no luck with detecting in tensorflow. `Wed Oct 26 03:27:27 2022 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.141.03 Driver Version: 470.141.03 CUDA Version: 11.4 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 NVIDIA GeForce ... Off | 00000000:01:00.0 On | N/A | | 0% 41C P8 14W / 400W | 166MiB / 12050MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | 0 N/A N/A 1474 G /usr/lib/xorg/Xorg 53MiB | | 0 N/A N/A 61887 G /usr/lib/xorg/Xorg 85MiB | | 0 N/A N/A 62028 G /usr/bin/gnome-shell 10MiB | +-----------------------------------------------------------------------------+`

EkinsMatthew commented 1 year ago

@Djdraper12 did you have any luck fixing this?