leptonai / gpud

Apache License 2.0
188 stars 11 forks source link

feat(nvidia): add bad-envs component for `DCGM_FR_BAD_CUDA_ENV` logic in DCGM #121

Closed gyuho closed 2 weeks ago

gyuho commented 2 weeks ago

Extend https://github.com/leptonai/gpud/pull/119.