issues
search
NVIDIA
/
gpu-operator
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/latest/index.html
Apache License 2.0
1.86k
stars
303
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump sigs.k8s.io/controller-runtime from 0.19.1 to 0.19.2
#1134
dependabot[bot]
opened
10 hours ago
1
Bump github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring from 0.78.1 to 0.78.2
#1133
dependabot[bot]
opened
10 hours ago
1
[GitHub Actions] inject golangci lint version at the right stage
#1132
tariq1890
closed
22 hours ago
0
Add support for SUSE SLE-Micro
#1131
e4t
opened
1 day ago
1
Bump github.com/onsi/ginkgo/v2 from 2.21.0 to 2.22.0
#1130
dependabot[bot]
closed
19 hours ago
2
Bump the k8sio group with 4 updates
#1129
dependabot[bot]
opened
1 day ago
1
Bump k8s.io/code-generator from 0.31.2 to 0.31.3 in /tools
#1128
dependabot[bot]
opened
1 day ago
1
[release-24.9] cherrypick changes for release 24.9.1
#1127
tariq1890
closed
1 day ago
0
bump driver versions to 550.127.08 and 535.216.03
#1126
tariq1890
closed
1 day ago
0
Bump github.com/regclient/regclient from 0.7.1 to 0.7.2
#1125
dependabot[bot]
closed
1 day ago
2
update golangci-lint version to v1.62.0 and pass in version via GITHUB_ENV
#1124
tariq1890
closed
1 day ago
0
[release-24.9] cherrypick H200 NVL MIG and container-toolkit 1.17.2 changes
#1123
tariq1890
closed
1 day ago
0
bump dcgm to 3.3.9 and dcgm-exporter to 3.3.9-3.6.1 versions
#1122
tariq1890
closed
2 days ago
0
Bump github.com/NVIDIA/nvidia-container-toolkit from 1.17.0 to 1.17.2
#1121
dependabot[bot]
closed
2 days ago
3
Bump golang.org/x/mod from 0.21.0 to 0.22.0
#1120
dependabot[bot]
closed
2 days ago
2
[Feature Request] Allow to configure CDI annotation prefix
#1119
xiongzubiao
opened
3 days ago
0
Nvidia GPU-Operator helm chart version does not reflect the branch tags
#1118
yonkatian
closed
1 day ago
2
[Feature] Support a custom e2e validation
#1117
changhyuni
opened
5 days ago
1
discovery-worker can't be ready. failed to read cpufreq directory
#1116
loprx
opened
5 days ago
1
[mig-manager] add support for H200 NVL
#1115
tariq1890
closed
2 days ago
2
bug: operator anti-pattern, validator pod deployments cause `CrashBackLoop` behaviour
#1114
justinthelaw
opened
1 week ago
0
CentOS 7.9 nvidia-operator-validator need GLIBC version: 2.27,default version: 2.1.7
#1113
einherjar9527
closed
2 days ago
0
bump golang to 1.23.3
#1112
tariq1890
closed
3 days ago
0
enable hostPID in mps-control-daemon
#1111
tariq1890
closed
3 days ago
1
Update NV-GHA IP Ranges
#1110
ArangoGutierrez
closed
2 weeks ago
0
container-toolkit fails to start after upgrading to v24.9.0 on k3s cluster
#1109
logan2211
opened
2 weeks ago
6
[nit] typo fix
#1108
BrainGithub
opened
2 weeks ago
1
Bump github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring from 0.76.2 to 0.78.1
#1107
dependabot[bot]
closed
3 days ago
8
[release-24.9] drop the distro-specific tag suffix from the device-plugin image
#1106
tariq1890
closed
2 weeks ago
0
drop the distro-specific tag suffix from the device-plugin image
#1105
tariq1890
closed
2 weeks ago
0
[release-24.9] move permissions for events from Role to ClusterRole
#1104
tariq1890
closed
2 weeks ago
0
[release-24.9] cleanup redundant sign:ngc jobs and fix bug in release:ngc job
#1103
tariq1890
closed
2 weeks ago
0
move permissions for events from Role to ClusterRole
#1102
tariq1890
closed
2 weeks ago
1
gpu-operator does not have permissions to create 'GPUDriverUpgrade' events
#1101
ein-stein-chen
closed
2 weeks ago
1
great
#1100
MatrixDCoder
closed
2 weeks ago
0
Upgrading gpu-operator on Rancher RKE2 results in nvidia-container-toolkit-daemonset failing to initialize
#1099
nikito
closed
3 weeks ago
2
Bump github.com/onsi/gomega from 1.35.0 to 1.35.1
#1098
dependabot[bot]
closed
2 days ago
3
[release-24.9] cleanup redundant sign:ngc jobs and fix bug in release:ngc job
#1097
tariq1890
closed
2 weeks ago
0
cleanup redundant sign:ngc jobs and fix bug in release:ngc job
#1096
tariq1890
closed
3 weeks ago
0
Revert "Revert "Use NV-GHA runners""
#1095
cdesiniotis
closed
2 weeks ago
0
[release-24.9] add 24.9.0 OLM bundle
#1094
tariq1890
closed
3 weeks ago
0
add 24.9.0 OLM bundle
#1093
tariq1890
closed
3 weeks ago
0
Helm release for v24.9.0
#1092
tariq1890
closed
3 weeks ago
2
Bump k8s-device-plugin, gpu-feature-discovery and node-feature-discovery versions
#1091
tariq1890
closed
3 weeks ago
0
Bump github.com/NVIDIA/nvidia-container-toolkit from 1.17.0-rc.2 to 1.17.0
#1090
dependabot[bot]
closed
3 weeks ago
1
Bump github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring from 0.76.2 to 0.78.0
#1089
dependabot[bot]
closed
2 weeks ago
2
add support for driver 565.57.01
#1088
tariq1890
closed
3 weeks ago
0
NvOpera-bixspac3
#1087
Lorcon1
closed
3 weeks ago
1
[Feature Request] Add hostNetwork mode for dcgmExporter
#1086
jslouisyou
opened
3 weeks ago
1
why nvidia-xconfig is disabled from libnvidia-config
#1085
yuclassic
opened
3 weeks ago
0
Next