issues
search
NVIDIA
/
gpu-operator
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
Apache License 2.0
1.25k
stars
238
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update vgpu device manager config for vGPU 17.2
#799
cdesiniotis
opened
10 hours ago
0
How to configuring dcgm metrics for MIG?
#798
laszlocph
opened
1 day ago
0
Adding transformation for kata-manager daemonset for supporting CRI-O
#797
visheshtanksale
opened
1 day ago
0
Bump dcgm to 3.3.6-1 and dcgm-exporter to 3.3.6-3.4.2
#796
cdesiniotis
closed
1 day ago
0
Cherry-pick Github Actions to release-24.3 branch
#795
cdesiniotis
closed
1 day ago
0
Update kubevirt-gpu-device-plugin to v1.2.8
#794
visheshtanksale
closed
1 day ago
0
standardise object hash generation across the repo
#793
tariq1890
closed
1 day ago
0
Cherry pick vfio graphics
#792
jojimt
closed
1 day ago
0
bump the rest of the gpu-operator dependencies
#791
tariq1890
closed
1 day ago
0
Create driver-ready file atomically
#790
cdesiniotis
closed
1 day ago
0
Create driver-ready file atomically
#789
elezar
closed
1 day ago
1
bump helm client to v0.12.10
#788
tariq1890
closed
2 days ago
2
How to query the validation result using api?
#787
chenditc
opened
3 days ago
0
update trunk image ref in OLM bundle
#786
tariq1890
closed
3 days ago
0
Bump k8s.io/apiextensions-apiserver from 0.30.1 to 0.30.2 in the k8sio group across 1 directory
#785
dependabot[bot]
closed
2 days ago
0
bump k8s.io and controller-runtime dependencies
#784
tariq1890
closed
3 days ago
0
Bump mig-manager to v0.8.0-rc.1
#783
cdesiniotis
closed
3 days ago
0
nvidia driver daemonset pod is recreated when ever there is a nfd restart
#782
charanteja333
opened
3 days ago
3
nvidia-driver-daemonset restart continuously
#781
Hokwang
opened
4 days ago
1
[MIG] add support for H200 141GB
#780
tariq1890
closed
4 days ago
0
Fix bug in validator/metrics.go for driver validation
#779
cdesiniotis
closed
4 days ago
0
Issue with the nvidia-device-plugin-daemonset error mounting /run/nvidia/driver/usr/lib/x86_64-linux-gnu/libnvidia-egl-gbm.so.1.1.1
#778
adwiza
opened
1 week ago
0
Exclude attestation manifests (sbom, provenance) during the build usi…
#777
shivakunv
opened
1 week ago
0
[operator-validator] remove redundant daemonset kube get calls
#776
tariq1890
closed
1 week ago
0
Bump container toolkit and device plugin to latest release candidates
#775
cdesiniotis
closed
1 week ago
0
Mount the host's /dev into the mig-manager container
#774
cdesiniotis
closed
4 days ago
1
Bump CUDA base image used by operands to 12.5.0
#773
cdesiniotis
closed
1 week ago
0
"leader election lost" gpu-operator pod restarts
#772
alnhk
opened
2 weeks ago
0
Bump github.com/docker/docker from 24.0.7+incompatible to 24.0.9+incompatible
#771
dependabot[bot]
closed
2 days ago
1
Bump golang.org/x/net from 0.22.0 to 0.23.0
#770
dependabot[bot]
closed
3 days ago
1
Bump golang.org/x/net from 0.22.0 to 0.23.0
#769
dependabot[bot]
closed
3 days ago
1
Bump github.com/NVIDIA/k8s-kata-manager from 0.0.0-20230620232711-08b57feb9b5a to 0.2.0
#768
dependabot[bot]
closed
1 day ago
0
Bump github.com/urfave/cli/v2 from 2.27.1 to 2.27.2
#767
dependabot[bot]
closed
2 days ago
1
Bump github.com/operator-framework/api from 0.23.0 to 0.26.0
#766
dependabot[bot]
closed
1 day ago
2
Bump the k8sio group with 4 updates
#765
dependabot[bot]
closed
3 days ago
1
Bump nvidia/cuda from 12.4.1-base-ubi8 to 12.5.0-base-ubi8 in /validator
#764
dependabot[bot]
closed
1 week ago
0
Bump nvidia/cuda from 12.4.1-base-ubi8 to 12.5.0-base-ubi8 in /docker
#763
dependabot[bot]
closed
1 week ago
0
Bump golangci/golangci-lint-action from 5 to 6
#762
dependabot[bot]
closed
2 weeks ago
0
/sys/module/firmware_class/parameters/path: Read-only file system -- nvidia-vgpu-manager-daemonset
#761
davidhwua
opened
2 weeks ago
0
Add dependabot config
#760
cdesiniotis
closed
2 weeks ago
1
Failed to create pod sandbox: rpc error: code = Unknown desc = failed to get sandbox runtime: no runtime for “nvidia” is configured
#759
chary1112004
opened
2 weeks ago
0
feat: vfio-manager graphics mode
#758
jojimt
closed
3 days ago
0
feat: vfio-manager graphics mode
#757
jojimt
closed
2 weeks ago
0
Graphics mode
#756
jojimt
closed
2 weeks ago
0
Error: failed to create containerd task: failed to create shim task: OCI runtime create failed
#755
pythonking6
opened
2 weeks ago
0
User "system:serviceaccount:default:gpu-operator" cannot list resource "daemonsets" in API group "apps" at the cluster scope with Helm Template
#754
Li357
opened
2 weeks ago
0
bump gpu drivers to 470.256.02, 535.183.01 and 550.90.07
#753
tariq1890
closed
2 weeks ago
0
No install errors and `nvidia-smi` works in gpu-operator pods but "Insufficient nvidia.com/gpu" error when using nvidia with any other pod
#752
v1nsai
closed
2 weeks ago
1
Enabling gpu on microk8s, pod/nvidia-driver-daemonset restart many times at status CrashLoopBackOff
#751
haiph-dev
opened
2 weeks ago
2
[RBAC] move namespace-scoped resource permissions to Roles
#750
tariq1890
closed
1 week ago
0
Next