Project-HAMi / volcano-vgpu-device-plugin

Device-plugin for volcano vgpu which support hard resource isolation
Apache License 2.0
44 stars 14 forks source link

volcano.sh/vgpu-memory显示是0 #18

Closed bx0216 closed 2 months ago

bx0216 commented 2 months ago

我的环境是8块A100的gpu, 当使用volcano-vgpu-device-plugin-with-monitor.yml 或 volcano-vgpu-device-plugin.yml时 kubectl get node/gpu-node -o yaml显示的gpu显存为0,如下: 微信图片_20240826151900

可以参考这个问题: https://github.com/volcano-sh/devices/issues/19

MiterV1 commented 2 weeks ago

同样的问题,不知道是否已解决

Hugh-yw commented 1 day ago

@bx0216 你好,请问你有么有测试过 卸载volcano-vgpu-device-plugin后,但是集群中节点信息中还存留 volcano.sh/vgpu-number、volcano.sh/gpu-memory 资源标签

MiterV1 commented 21 hours ago

是的,是卸载的问题。 标签没有更新。

Hugh-yw commented 11 hours ago

是的,是卸载的问题。 标签没有更新。

还没解决是麽