4paradigm / k8s-vgpu-scheduler

OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
Apache License 2.0
489 stars 93 forks source link

如何在Prometheus里监控gpu的使用情况 #40

Open efeng-blue opened 3 months ago

efeng-blue commented 3 months ago

包括每个pod的gpu使用率,显存。有相关指标的介绍吗?