OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense.
1. Issue or feature description
2. Steps to reproduce the issue
3. Information to attach (optional if deemed irrelevant)
Common error checking:
[ ] The output of nvidia-smi -a on your host
[ ] Your docker configuration file (e.g: /etc/docker/daemon.json)
[ ] The k8s-device-plugin container logs
[ ] The kubelet logs on the node (e.g: sudo journalctl -r -u kubelet)
Additional information that might help better understand your environment and reproduce the bug:
[ ] Docker version from docker version
[ ] Docker command, image and tag used
[ ] Kernel version from uname -a
[ ] Any relevant kernel output lines from dmesg
[ ] NVIDIA packages version from dpkg -l '*nvidia*'orrpm -qa '*nvidia*'
[ ] NVIDIA container library version from nvidia-container-cli -V
The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense.
1. Issue or feature description
2. Steps to reproduce the issue
3. Information to attach (optional if deemed irrelevant)
Common error checking:
nvidia-smi -a
on your host/etc/docker/daemon.json
)sudo journalctl -r -u kubelet
)Additional information that might help better understand your environment and reproduce the bug:
docker version
uname -a
dmesg
dpkg -l '*nvidia*'
orrpm -qa '*nvidia*'
nvidia-container-cli -V