Open GKG1312 opened 2 months ago
hi @GKG1312 For release 2.4.0-2.4.1 please use one of the following CUDA versions: 11.8, 12.1, 12.4
hi @GKG1312 For release 2.4.0-2.4.1 please use one of the following CUDA versions: 11.8, 12.1, 12.4
Hi @atalman, just tried with versions mentioned below, but it did not solve anything. The output is same as previous. I am running this with NVIDIA Geforce RTX 4070 laptop GPU (8GB).
python=3.12.4
pytorch=2.4.0
pytorch_cuda=12.1
torch-tb-profiler=0.4.3
cuda-version=12.1
tensorboard=2.17.1
One thing I noticed now is that in memory view I can see GPU0
as device, but in overview section it is not showing such.
Sorry, if I am being naive here.
I tried to run the same in google colab and there I can see GPU summary in overview section.
I am trying to run pytorch profiler with tensorboard tutorial from pytorch/tutorial in Windows 11 in a conda environment and following version
The code executes with only a single warning message
[W904 11:50:36.000000000 CPUAllocator.cpp:249] Memory block of unknown size was allocated before the profiling started, profiler results will not include the deallocation event
. However, the tensorboard shows only CPU as device and dataloader time as 0.I am not able to figure out if it is a bug or because of version mismatch. Simplified code to replicate error:
cc @aaronenyeshi @chaekit @jcarreiro