issues
search
GoogleCloudPlatform
/
container-engine-accelerators
Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine
Apache License 2.0
211
stars
150
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update nvidia-driver-installer pull policy for init container
#354
konturn
opened
6 months ago
1
Nvml update - update dependencies from go mod tidy and go mod vendor
#353
aston-github
closed
7 months ago
0
Upgrade nvml version to be compatible with golang:1.22-bullseye
#352
aston-github
closed
7 months ago
0
Vulnerability Fix: Go version update for partition-gpu and gpu-device-plugin images
#351
aston-github
closed
7 months ago
0
Add missing dependency for python3
#350
dulacp
opened
7 months ago
0
Update tcpx nccl installer version
#349
grac3gao
closed
8 months ago
1
NRI device injector skips duplicate devices
#348
Jiaqicao257
closed
7 months ago
0
Add a new nccl-test manifest example
#347
grac3gao
closed
8 months ago
0
Add new years to boilderplate check
#346
Jiaqicao257
closed
8 months ago
0
Add NRI device injector package
#345
Jiaqicao257
closed
8 months ago
3
gpudirect-tcpx: add daemonset for optmem_max
#344
samuelkarp
closed
8 months ago
0
Update Makefile
#343
luisvillarreal
opened
8 months ago
2
Add dependabot.yml to configure schedule and reviewers
#342
aston-github
closed
9 months ago
1
Configure Dependabot schedule and default reviewers
#341
aston-github
closed
9 months ago
0
Fix typo in driver installation script.
#340
YmirKhang
closed
8 months ago
1
Update metrics server manifest
#339
grac3gao
closed
8 months ago
1
Add a new nccl-test manifest includes ENV configuration
#338
grac3gao
closed
8 months ago
1
Update nccl-test manifest
#337
grac3gao
closed
9 months ago
0
Bump up version
#336
grac3gao
closed
9 months ago
0
Update go version for gpu partition image
#335
grac3gao
closed
9 months ago
0
Update dependencies to resolve alerts of potential vulnerabilities
#334
grac3gao
closed
9 months ago
0
tcpx: ensure nccl installer failure propagates
#333
samuelkarp
closed
10 months ago
0
Create NCCL_INSTALL_DIR if not existing
#332
linxiulei
closed
9 months ago
1
Add tcpx metrics server manifest
#331
grac3gao
closed
10 months ago
0
Create Dependabot Configuation to support Docker and go module updates
#330
zhuxiaow0
opened
10 months ago
0
gpudirect-tcpx: update RxDM image and args
#329
samuelkarp
closed
10 months ago
0
tcpx: update RxDM arguments
#328
samuelkarp
closed
10 months ago
0
Bump google.golang.org/grpc from 1.28.1 to 1.56.3
#327
dependabot[bot]
opened
10 months ago
1
Update tcpx manifests
#326
grac3gao
closed
11 months ago
0
Update tcpx manifests
#325
grac3gao
closed
11 months ago
0
Bump golang.org/x/net from 0.0.0-20201110031124-69a78807bb2b to 0.17.0
#324
dependabot[bot]
opened
11 months ago
0
Update nccl-test.yaml
#323
grac3gao
closed
11 months ago
0
Remove check for existing driver modules in driver installer
#322
Jiaqicao257
closed
11 months ago
2
Remove mount update
#321
grac3gao
closed
11 months ago
0
tcpx: update nccl-plugin-gpudirecttcpx image
#320
samuelkarp
closed
11 months ago
0
tcpx: update nccl-plugin-gpudirecttcpx image
#319
samuelkarp
closed
11 months ago
0
gpudirect-tcpx: update test images
#318
samuelkarp
closed
1 year ago
2
gpudirect-tcpx: update installer image tag
#317
samuelkarp
closed
1 year ago
0
Update GPU driver isntaller daemonset to check for existing driver mo…
#316
Jiaqicao257
closed
1 year ago
0
Update driver installer manifest
#315
grac3gao
closed
1 year ago
1
Update Nvidia driver installer to check for existing drivers
#314
Jiaqicao257
closed
1 year ago
0
Update tcpx manifest
#313
grac3gao
closed
1 year ago
0
Update golang base image
#312
Jiaqicao257
closed
1 year ago
0
Update partition gpus image
#311
Jiaqicao257
closed
1 year ago
0
Bump version to 1.0.24
#310
Jiaqicao257
closed
1 year ago
0
Add MIG support for H100 GPU
#309
Jiaqicao257
closed
1 year ago
2
Update mem limit for MPS
#308
grac3gao
closed
1 year ago
0
Update nccl-plugin manifest
#307
grac3gao
closed
1 year ago
0
mps update for pinned device memory
#306
grac3gao
closed
1 year ago
0
Add TCPX manifest
#305
grac3gao
closed
1 year ago
0
Previous
Next