issues
search
NVIDIA
/
k8s-device-plugin
NVIDIA device plugin for Kubernetes
Apache License 2.0
2.45k
stars
573
forks
source link
issues
Most commented
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Support sharing GPUs
#169
ktarplee
opened
4 years ago
38
Requesting zero GPUs allocates all GPUs
#61
dhague
closed
3 months ago
35
Getting nvidia-device-plugin container CrashLoopBackOff | version v0.14.0 | container runtime : containerd
#406
DineshwarSingh
opened
11 months ago
33
Getting GPU device minor number: Not Supported
#332
zengzhengrong
opened
1 year ago
31
Use mps on kubernetes
#467
somelaoda
opened
2 years ago
28
k8s-device-plugin v1.9 deployment CrashLoopBackOff
#16
seekyiyi
closed
6 years ago
27
MPS with Kubernetes on NVIDIA GPU
#443
selinnilesy
opened
7 months ago
25
Pods are not scheduled in all GPUs of a physical server.
#328
shan100github
closed
1 year ago
25
MPS use error: Failed to allocate device vector A (error code all CUDA-capable devices are busy or unavailable)!
#647
lengrongfu
opened
1 month ago
24
How to solve could not load NVML library: libnvidia-ml.so.1
#478
yeeeee7
closed
2 months ago
24
OpenShift 3.9/Docker-CE, Could not register device plugin: context deadline exceeded
#55
DragOnMe
closed
5 years ago
24
0/1 nodes are available: 1 Insufficient nvidia.com/gpu
#33
ernestmartinez
closed
6 years ago
23
How to use the device plugin with new k8s 1.24 version?
#302
Zigko
opened
2 years ago
22
Following the QuickStart but my pod is stuck in pending state
#176
dwschulze
closed
1 month ago
21
Resource type labelling is incomplete/incorrect
#257
anaconda2196
opened
2 years ago
20
Docker image for Nvidia Jetson Nano
#132
vdups
opened
4 years ago
18
Crio integration?
#62
jordimassaguerpla
closed
5 years ago
18
Cannont pass through RTX 3090 into pod; Failed to initialize NVML: could not load NVML library.
#263
davidho27941
opened
2 years ago
17
Error: failed to start container "nvidia-device-plugin-ctr": Error response from daemon: OCI runtime create failed: container_linux.go:349: starting container process caused "process_linux.go:449: container init caused \"process_linux.go:432: running prestart hook 0 caused \\\"error running hook: signal: segmentation fault (core dumped), stdout: , stderr: \\\"\"": unknown
#171
wxitzxg
closed
1 month ago
17
Multiple pods share one GPU
#134
charlieye-dev
closed
4 years ago
17
Device-plugin does not bother to properly do a cleanup of the info about GPUs after MIG enable/disable or after reconfiguration
#240
dchirikov
opened
3 years ago
16
fatalnvml: Insufficient Permissions
#201
qingshanyinyin
closed
1 month ago
16
0/3 nodes are available: 1 PodToleratesNodeTaints, 3 Insufficient nvidia.com/gpu.
#22
bleachzk
closed
6 years ago
16
Unable to install in Ubuntu 20.04 a nvidia container toolkit with version < 1.14.4
#509
AlexisGitHu
closed
3 months ago
15
Device Plugin is not returning with an error, Pod not restarted
#170
zvonkok
closed
3 years ago
15
k8s-device-plugin fails with k8s static CPU policy
#145
johnathanhegge
closed
3 years ago
15
nvidia-device-plugin container CrashLoopBackOff error
#11
WanLinghao
closed
6 years ago
15
k8s-device-plugin restarts on k3s deployment (on top of containerd)
#368
hholst80
opened
1 year ago
14
Failed to initialize NVML: Unknown Error for when changed runtime from docker to containerd
#322
zvier
opened
1 year ago
14
pod fail to find gpu some time after created
#289
JuHyung-Son
closed
2 years ago
14
Advertising specific GPU types as separate extended resource
#424
deepanker-s
opened
10 months ago
13
Unable to get nvidia.com/gpu: "1" greater than 1 for Quadro P2000
#321
brianbrady
closed
1 year ago
13
Installation failed k8s-device-plugin(v0.9.0)
#253
Kwonho
opened
2 years ago
13
Supporting Multi-Instance GPUs (MIG)
#180
klueska
closed
3 years ago
13
NVIDIA device plugin isn't advertising the GPUs
#390
glopezdiest
closed
1 year ago
12
Question about MIG config persistent
#343
slow-zhang
opened
1 year ago
12
K8s 1.26 failed to schedule using GPU-(error code CUDA driver)- could not load NVML library: libnvidia-ml.so.1: cannot
#604
luhong123
closed
1 week ago
11
Using CUDA MPS to enable GPU sharing, the pod occupies all GPU memory.
#569
ysz-github
opened
2 months ago
11
Remove k8s.io replace rules
#529
elezar
closed
2 months ago
11
Bump helm.sh/helm/v3 from 3.13.1 to 3.14.1
#521
dependabot[bot]
closed
3 months ago
11
Container fails to initialize NVML even after setting default docker runtime=nvidia
#182
limwenyao
closed
1 month ago
11
Setting nvidia.com/gpu:NoSchedule taint causes GPU nodes to repel k8s-device-plugin, making GPU nodes unschedulable for GPU jobs
#68
anna-hope
closed
5 years ago
11
what k8s does behind device plugin ?
#37
mingfengwuye
closed
6 years ago
11
Remove duplicated deployment yaml file at root
#648
ArangoGutierrez
closed
1 month ago
10
Plug in does not detect Tegra device Jetson Nano
#377
VladoPortos
opened
1 year ago
10
Use containerd instead of docker for GPU support on kubernetes
#275
DrissiReda
closed
2 years ago
10
K8s cannot schedule pod on full GPU when some other GPUs are MIG enabled
#260
nacef-labidi
closed
3 months ago
10
device plugin start with --device-list-strategy=volume-mounts, pod create error : nnvidia-container-cli: device error: unknown device id: /var/run/nvidia-container-devices\\\\n\\\"\"": unknown
#200
xqqcoder
closed
1 month ago
10
OutOfnvidia.com/gpu error appeared after "Instance terminated during maintenance" operation in GCE. "nvidia-container-cli: device error: unknown device id"
#98
dkozlov
closed
1 month ago
10
How to resolve kubelet error after installing nvidia device driver plugin in k8s worker nodes, getting nvidia-smi: dial tcp 172.28.3.100:10250: getsockopt: no route to host
#72
Bharathkumarraju
closed
5 years ago
10
Next