NVIDIA / k8s-device-plugin

NVIDIA device plugin for Kubernetes
Apache License 2.0
2.68k stars 606 forks source link

Using CUDA MPS to enable GPU sharing in K8S, error:error checking MPS daemon health #634

Closed zhangQiWorr closed 4 weeks ago

zhangQiWorr commented 5 months ago

image

image

image

zhangQiWorr commented 5 months ago

Should I start MPS daemon(nvidia-cuda-mps-control) in the k8s node?

elezar commented 5 months ago

@zhangQiWorr are you deploying the device plugin using helm? The use of MPS in the device plugin requires both GFD and an additional component to manage the lifecycle of the MPS control daemon. Here the use of helm is recommended since this deploys the relevant daemonsets.

github-actions[bot] commented 1 month ago

This issue is stale because it has been open 90 days with no activity. This issue will be closed in 30 days unless new comments are made or the stale label is removed.

github-actions[bot] commented 4 weeks ago

This issue was automatically closed due to inactivity.