NVIDIA / go-dcgm

Golang bindings for Nvidia Datacenter GPU Manager (DCGM)
Apache License 2.0
96 stars 27 forks source link

Where can I get the sample output of various policy failures? #43

Closed vinayburugu closed 1 year ago

vinayburugu commented 1 year ago

I registered the policies and want to handle the policy failures using below example code. Where can I get the sample output of each of the policy failures? https://github.com/NVIDIA/go-dcgm/blob/main/samples/policy/main.go

glowkey commented 1 year ago

There are no specific examples but the existing policy tests and DCGM error injection should help with your development:

https://github.com/NVIDIA/DCGM/blob/master/testing/python3/tests/test_policy.py https://docs.nvidia.com/datacenter/dcgm/latest/user-guide/dcgm-error-injection.html