issues
search
NVIDIA
/
gpu-operator
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
Apache License 2.0
1.25k
stars
237
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[RBAC] move namespace-scoped resource permissions to Roles
#750
tariq1890
closed
1 week ago
0
[CC Mgr VGPU Device Mgr] move pods access permissions from ClusterRole to Role
#749
tariq1890
closed
2 weeks ago
0
Failed to fetch https://helm.ngc.nvidia.com/nvidia/index.yaml : 401 Unauthorized
#748
gopricy
closed
2 weeks ago
8
Add rootFS and driverInstallDir fields to ClusterPolicy
#747
cdesiniotis
closed
1 week ago
0
[OCP] restrict RBAC perms of gpu-operator in OLM bundle
#746
tariq1890
closed
2 weeks ago
0
Make kubelet path configurable in dcgm-exporter
#745
aishwaryaraimule21
opened
3 weeks ago
0
[OCP] add back permissions to set finalizers on CRDs
#744
tariq1890
closed
3 weeks ago
0
Add Env valueFrom function
#743
mayooot
opened
3 weeks ago
0
label to nodes are too much, especially label non-gpu nodes.
#742
johnzheng1975
opened
3 weeks ago
0
Add H20 to default mig-manager config
#741
cdesiniotis
closed
2 weeks ago
0
[CI] specify crictl version in holodeck yaml config
#740
tariq1890
closed
2 weeks ago
0
add ngc signing job for auto signing
#739
shivakunv
closed
6 days ago
2
Rename master to main
#738
elezar
closed
3 weeks ago
0
[RBAC] Remove unnecessary permissions from the gpu-operator app
#737
tariq1890
closed
3 weeks ago
0
Upload e2e test artifacts with different names
#736
cdesiniotis
closed
3 weeks ago
0
Bump NFD to v0.16.0
#735
cdesiniotis
opened
3 weeks ago
1
Add MPS test to e2e test suite
#734
cdesiniotis
opened
3 weeks ago
0
Enable the use of CDI on OpenShift
#733
cdesiniotis
opened
3 weeks ago
1
Migrate from ClusterPolicy to NVIDIADriver owned driver daemonsets
#732
cdesiniotis
opened
3 weeks ago
0
Ensure CDI specs do not contain duplicate driver firmware files
#731
cdesiniotis
opened
3 weeks ago
0
no runtime for "nvidia" is configured
#730
yanis-incepto
opened
4 weeks ago
7
AKS node with deallocated, gpu drivers can't be installed anymore after ~10 restarts
#729
Johannesm299
opened
4 weeks ago
1
Installation result: some daemonset not be installed, some of them install too much
#728
johnzheng1975
opened
1 month ago
0
Adding volume mount to sandbox DP to support GPU healthcheck
#727
visheshtanksale
closed
3 weeks ago
0
k8s 1.22.10, helm install nvidia-operator, raise error: unknown field "grpc" in io.k8s.api.core.v1.Probe]
#726
sycbbyes
opened
1 month ago
4
Use NVIDIADriver CRD to install GPU driver in a centos7 and a ubuntu22.04
#725
lengrongfu
opened
1 month ago
1
unknown field "grpc" in io.k8s.api.core.v1.Probe
#724
corrtia
closed
1 month ago
3
NVIDIA GPU Operator 24.23.0 Failed on OCP 4.14.23 Cluster
#723
habjouqa
opened
1 month ago
3
Ubuntu 24.04 Image Missing For nvidia-driver-daemonset
#722
isugimpy
opened
1 month ago
3
Add Github Actions
#721
cdesiniotis
closed
3 weeks ago
5
update Update 0500_configmap.yaml to support L40S vgpu
#720
jxdn
opened
1 month ago
4
Update for L40S vGPU Support
#719
jxdn
opened
1 month ago
1
GPU drivers not installing with host kernel 6.8 and vGPU 16.5 (535.161.05)
#718
urbaman
opened
1 month ago
7
unable to enable MPS strategy
#717
thien-lm
opened
1 month ago
0
GPU operator has the compatible issue with pre-default driver of VMSS
#716
zhangchl007
closed
1 month ago
1
When using a precompiled driver and all gpu nodes are not ready, gpu-operator will loop to deleted and recreated `nvidia-driver-daemonset`
#715
Levi080513
opened
1 month ago
6
VirtualGL with NVIDIA GPU Operator in EKS (Invalid EGL device)
#714
Mohamed-ben-khemis
opened
1 month ago
1
Cannot enable GDRcopy using Nvidia driver CRD due to wrong indentation in 0500_daemonset.yaml
#713
age9990
closed
1 month ago
2
containerd restarts at least once an hour
#712
tatodorov
opened
1 month ago
2
Unable to install dcgm, dcgm-exporter in kubevirt vm-passthrough mode
#711
rokkiter
opened
2 months ago
1
Allow adding custom labels to the "gpu-operator" ServiceMonitor
#710
peihsuant
opened
2 months ago
3
In GPU operator v23.9.2 driver Image is missing for bottlerocket1.19.2
#709
OS-walidslim
closed
1 month ago
1
Issue with autoscaler scheduling
#708
Jasper-Ben
opened
2 months ago
2
"nvidia-smi": executable file not found in $PATH: unknown
#707
zlianzhuang
opened
2 months ago
1
DCGM-Exporter cannot access configmap, access denied
#706
Bromhir84
closed
1 month ago
3
Driver daemonset uninstall the driver on node reboot even if no new version is available
#705
slik13
opened
2 months ago
2
Add nodeSelect to ClusterPolicySpec CRD to select different MIG strategy in different node
#704
lengrongfu
closed
2 months ago
4
After the GPU node is restarted, an error occurs when the nvidia-driver-daemonset pod is started in the offline environment
#703
sunwuyan
opened
2 months ago
4
InitContainers have non configurable and explicitely empty resrources
#702
miguelglopes
opened
2 months ago
1
nvidia.com/gpu.deploy.mig-manager label not delete
#701
lengrongfu
closed
2 months ago
7
Previous
Next