issues
search
AliyunContainerService
/
gpushare-scheduler-extender
GPU Sharing Scheduler for Kubernetes Cluster
Apache License 2.0
1.39k
stars
308
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
运行了一年后,创建新的 pod 报错 failed bind with extender at URL http://127.0.0.1:32766/gpushare-scheduler/bind, code 500
#229
klvchen
opened
1 week ago
3
节点上有多个GPU时,无法正常分配GPU
#228
hotbaby
opened
1 month ago
1
副本问题
#227
AndrewOYLK
opened
1 month ago
0
k8s上安装好插件,无法识别到集群GPU资源
#226
ferris-cx
opened
2 months ago
1
版本兼容性问题
#225
ferris-cx
opened
2 months ago
0
[AKS] kube-scheduler static POD not running for Aliyun GPU Scheduler Extender
#224
dsatizabal
opened
3 months ago
1
kubelet版本問題
#223
longcheung123
opened
3 months ago
0
方案只能在阿里云上的机器里使用吗
#222
wolgod
opened
3 months ago
1
Remove DeletionTimestamp!=nil condition in IsCompletePod function
#221
zhangbc97
opened
8 months ago
0
ALIYUN_COM_GPU_MEM_IDX in the annotation is different than ALIYUN_COM_GPU_MEM_IDX inside the pod
#220
wokalski
opened
9 months ago
0
这个项目目前在使用过程中存在的问题
#219
freelizhun
opened
9 months ago
0
调度层有bug吧,请求8G,实际设备最大7G,居然最终能创建成功pod
#218
hiahia121
opened
9 months ago
0
关于显存申请基本单位改为MiB但不起作用的问题
#217
harrymore
opened
10 months ago
0
该项目还在维护吗
#216
zhaizhch
opened
10 months ago
0
Support for Horizontal Pod Autoscaling (HPA) with GPU Pods? 是否支持使用GPU Pods的水平Pod自动扩展(HPA)?
#215
tobq
opened
10 months ago
1
feat: adjust k8s to 1.28
#214
Yobol
opened
1 year ago
0
如果一个机器上有两张卡,第一张卡的内存使满了,之后的任务会调度到另一张卡上吗
#213
vicmeng
opened
1 year ago
0
如果想要指定使用两张显卡多卡训练 该怎么做
#212
vicmeng
opened
1 year ago
1
这个GPU共享插件支持使用dcgm-exporter做监控吗
#211
db-root
opened
1 year ago
4
Back-off restarting failed container: gpushare-device-plugin-ds-xxxxx
#210
JiangLingJun
opened
1 year ago
1
你好,kubectl logs这个命令在gpu容器上无效,在普通容器上却可以
#209
140ai
closed
1 year ago
0
GPU cores scheduling / GPU核心调度
#208
valafon
opened
1 year ago
0
plugin does not evenly distribute the pods. 这个插件无法均匀分配Pod。
#207
valafon
opened
1 year ago
2
docs: update install.md
#206
KunWuLuan
closed
1 year ago
0
Not able to use gpushare-scheduler-extender on EKS cluster with Kubernetes v1.24
#205
suchisur
opened
1 year ago
2
优化循环查找可用设备
#204
wangzhipeng
opened
1 year ago
1
Bump golang.org/x/net from 0.1.1-0.20221027164007-c63010009c80 to 0.7.0
#203
dependabot[bot]
opened
1 year ago
0
显存与真实情况不符
#202
SakuraAxy
opened
1 year ago
1
多次进行删除创建Pod之后,会导致新创建Pod出现Pending状态
#201
liufangpeng
opened
1 year ago
0
scheduler-policy-config.yaml文件咨询
#200
liufangpeng
closed
1 year ago
0
使用kubeflow1.6.1 使用自定义镜像有问题
#199
631068264
opened
1 year ago
3
nodeinfo.go allocateGPUID method optimization
#198
wangxiaoyang-dev
opened
1 year ago
0
fix: gpushare concurrent map read write
#197
swartz-k
opened
1 year ago
0
k3s services not started scheduler exited: stat /etc/kubernetes/scheduler.conf: no such file or directory
#196
RotemAmergi
opened
1 year ago
1
fix: circleci version
#195
swartz-k
closed
1 year ago
1
feat: upgrade golang to 1.19 and k8s to 1.25
#194
swartz-k
closed
1 year ago
0
fix: controller process item logic
#193
swartz-k
closed
1 year ago
0
Controller processNextWorkItem return false when err == nil
#192
swartz-k
closed
1 year ago
0
Wrong GPU ID
#191
tintranvan
closed
1 year ago
4
How to share arithmetical force of a gpu?
#190
joeevon
opened
1 year ago
1
trivy image scan lists critical and high vulnerability against latest image k8s-gpushare-schd-extender:1.11-d170d8a
#189
carlwang87
opened
1 year ago
0
gpu pods are in pending states despite of enough gpu resource
#188
mf-giwoong-lee
closed
1 year ago
0
pod运行完成后,插件更新gpu池不及时。当有多个pending的pod排队分配资源时,最后一个pod会一直等到flushUnschedulablePodsLeftover才会重新分配资源
#187
huiyangz
opened
2 years ago
0
读取到了两块显卡,但是请求/gpushare-scheduler/filter后部分容器一直只能调度到其中一块显卡
#186
1003111014
closed
1 year ago
1
pod包含多个container时报错: "unknown device id: no-gpu-has-5MiB-to-run"
#185
serend1p1ty
opened
2 years ago
1
单机双显卡时,调度器显示绑定到了不同的显卡上,实际全部都调度到了一张显卡上
#184
1003111014
opened
2 years ago
1
Update ADOPTERS.md
#183
ftx0day
closed
1 year ago
1
Any instruction/template to help define customized GPU scheduler policy?
#182
blackjack2015
opened
2 years ago
0
Device list strategy - mounts
#181
xhejtman
opened
2 years ago
0
Microk8s installation instructions
#180
agnoam
opened
2 years ago
0
Next