issues
search
NVIDIA
/
k8s-dra-driver
Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes
Apache License 2.0
263
stars
49
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Local Testing (with GTX1080) - error creating driver: failed to create device library: failed to locate driver libraries: error locating "libnvidia-ml.so.1"
#204
wenzel-felix
opened
2 days ago
1
Bump github.com/NVIDIA/nvidia-container-toolkit from 1.16.1 to 1.17.1
#203
dependabot[bot]
opened
4 days ago
0
Bump golang.org/x/sys from 0.26.0 to 0.27.0
#202
dependabot[bot]
opened
4 days ago
0
Update all demo scripts for use on GKE with a k8s 1.31 alpha cluster
#201
klueska
opened
1 week ago
1
Bump github.com/NVIDIA/nvidia-container-toolkit from 1.16.1 to 1.17.0
#200
dependabot[bot]
closed
4 days ago
1
Bump github.com/NVIDIA/nvidia-container-toolkit from 1.16.1 to 1.16.2
#199
dependabot[bot]
opened
2 weeks ago
0
Add explicit envvar to control if we mask /proc/driver/nvidia/params
#198
klueska
closed
2 weeks ago
0
Properly handle graceful shutdown of the kubelet plugin
#197
klueska
closed
2 weeks ago
0
Fix regression with supporting operator managed drivers
#196
klueska
closed
2 weeks ago
0
Fix incorrect error string
#195
klueska
closed
2 weeks ago
0
Update gpu-test6 to adhere to new DRA APIs
#194
klueska
closed
2 weeks ago
0
Fix regression with managing claim-specific CDI devices
#193
klueska
closed
2 weeks ago
0
Bump github.com/urfave/cli/v2 from 2.27.2 to 2.27.5
#192
dependabot[bot]
closed
2 weeks ago
0
Bump github.com/prometheus/client_golang from 1.19.1 to 1.20.5
#191
dependabot[bot]
opened
2 weeks ago
0
Bump github.com/NVIDIA/go-nvlib from 0.6.0 to 0.7.0
#190
dependabot[bot]
closed
2 weeks ago
0
Refactor the IMEX controller code
#189
klueska
closed
2 weeks ago
0
Fix bug in deploying to kind without GFD
#188
klueska
closed
2 weeks ago
0
Ensure each imex domain.cliqueId has a unique set of channel numbers
#187
ArangoGutierrez
closed
2 weeks ago
2
Clean up resourceSlices during exit
#186
ArangoGutierrez
closed
3 weeks ago
4
Add support to selectively decide which device classes to support via helm
#185
klueska
closed
3 weeks ago
0
Update how cross-compiling works to speed things up
#184
klueska
closed
3 weeks ago
0
Support for allocating GPUs in Passthrough-Mode
#183
varunrsekar
opened
4 weeks ago
1
Add a MIG example
#182
yuanchen8911
opened
4 weeks ago
0
Update the MIG+TS example withe the new apiVersion
#181
yuanchen8911
closed
4 weeks ago
0
Sort UUIDs of prepared and allocatable devices for comparisons
#180
klueska
closed
1 month ago
0
Add function to discover IMEX major number rather than hard-coding it
#179
klueska
closed
1 month ago
0
Add support for a centralized controller to advertise IMEX channel
#178
klueska
closed
1 month ago
1
Does the nvidia dra-driver support structed parameter DRA?
#177
fj-zhang-lei
closed
1 month ago
1
Bump github.com/NVIDIA/nvidia-container-toolkit from 1.16.1 to 1.16.2
#176
dependabot[bot]
closed
1 week ago
1
Update the MPS example in quickstart with the latest apiVersion
#175
yuanchen8911
closed
2 weeks ago
0
NVlink support
#174
ritazh
opened
1 month ago
1
Add imex-resourceslice.yaml to both nvkind and kind
#173
klueska
closed
1 month ago
0
Add IMEX support in the kubelet plugin
#172
klueska
closed
1 month ago
0
Add labels to divide multi-node GPU demo into 2 separate IMEX domains
#171
klueska
closed
1 month ago
0
Add new nvkind target for demo/clusters with multinode GPUs support
#170
klueska
closed
1 month ago
0
Force mask of /proc/driver/nvidia/params to prevent /dev node creation
#169
klueska
closed
1 month ago
0
Use CDI to inject GPUs into the kind workers instead of 'legacy' mode
#168
klueska
closed
1 month ago
0
Update all demos so that their PID 1 processes explicitly catch SIGTERM
#167
klueska
closed
1 month ago
0
Working with dcgm-exporter
#166
ritazh
opened
1 month ago
5
Does dra-driver need a resource controller ?
#165
AllenXu93
closed
1 month ago
3
README: Update the expectations after installation
#164
surajssd
closed
1 month ago
0
Replace cuda-sample image for nbody with older CUDA image
#163
klueska
closed
2 months ago
0
Set NVIDIA_VISIBLE_DEVICES to void to bypass nvidia-container-runtime
#162
klueska
closed
2 months ago
0
Fix name of repo for pushing to github registry
#161
klueska
closed
2 months ago
0
Update github workflow for image builds to the latest standard one
#160
klueska
closed
2 months ago
0
Fix bug with MIG devices introduced when adding opaque config support
#159
klueska
closed
2 months ago
0
Bump github.com/opencontainers/runc from 1.1.13 to 1.1.14
#158
dependabot[bot]
closed
2 weeks ago
2
Add support for applying opaque configs for Time-Slicing and MPS on both full GPUs and MIG devices
#157
klueska
closed
2 months ago
0
Add static MIG support for the DRA APIs available in 1.31
#156
klueska
closed
2 months ago
0
Bump github.com/prometheus/client_golang from 1.19.0 to 1.20.3
#155
dependabot[bot]
closed
2 weeks ago
1
Next