Open berkaroad opened 7 months ago
in pkg/dcgm/process_info.go/getProcessInfo
.
pInfo := ProcessInfo{
GPU: uint(pidInfo.summary.gpuId), // will always use same gpu
PID: uint(pidInfo.pid),
Name: name,
ProcessUtilization: processUtil,
PCI: pci,
Memory: memory,
GpuUtilization: gpuUtil,
Clocks: clocks,
Violations: violations,
XIDErrors: xidErrs,
}
change uint(pidInfo.summary.gpuId)
to uint(pidInfo.gpus[I].gpuId)
will fix it.
Run benchmarks with 2 gpus, and compare with
./processInfo -pid 203639
andnvidia-smi
.'GPU ID' from
./processInfo -pid 203639
is GPU-0, GPU-0. But innvidia-smi
is GPU-0, GPU-1.