NVIDIA / DCGM

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs
Apache License 2.0
404 stars 52 forks source link

Error setting watches. Result: -33: This request is serviced by a module of DCGM that is not currently loaded #47

Open mintchocohoco opened 2 years ago

mintchocohoco commented 2 years ago

hello, I am using p100 gpu, and there is a problem that more than 1000 features(1002,1003,1004,1005....) are not work with this error code

Error setting watches. Result: -33: This request is serviced by a module of DCGM that is not currently loaded

nikkon-dev commented 2 years ago

@mintchocohoco,

The DCP metrics (1001...) are supported starting from Turing architecture. Pascal is not supported. For some metrics (1013,1014) you would need at least an Ampere GA100 chip.