NVIDIA / DCGM

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs
Apache License 2.0
387 stars 50 forks source link

Running diagnostics causes the Memory Usage of the other GPUs to increase #167

Open BetaZYN opened 4 months ago

BetaZYN commented 4 months ago

When I run a long diagnostic on one GPU, the Memory Usage of the other GPUs goes from 0 to 3M, and then back to 0 when the diagnostic is finished. Is this normal? GPU model is L40S I look forward to your reply