AliyunContainerService / gpushare-scheduler-extender

GPU Sharing Scheduler for Kubernetes Cluster
Apache License 2.0
1.36k stars 303 forks source link

Not able to use gpushare-scheduler-extender on EKS cluster with Kubernetes v1.24 #205

Open suchisur opened 1 year ago

suchisur commented 1 year ago

With refernce to https://github.com/AliyunContainerService/gpushare-scheduler-extender/issues/145 , there has been a olution given to use the extender in EKS, however the solution only works for Kubernetes v1.23 and below. Since kubernetes v1.24, there has been a removal of scheduling policies , they are no longer supported(https://kubernetes.io/docs/reference/scheduling/policies/) instead scheduler configurations(https://kubernetes.io/docs/reference/scheduling/config/) should be used. Have tried using the new KubeSchedulerConfiguration by editing the configmap. The image has changed as well, and the pods do not come up. Any help would be appreciated

KunWuLuan commented 1 year ago

Hi, you can modify the schedule framework config like the following:

---
apiVersion: kubescheduler.config.k8s.io/v1beta2
kind: KubeSchedulerConfiguration
clientConnection:
  kubeconfig: /etc/kubernetes/scheduler.conf
extenders:
- urlPrefix: "http://127.0.0.1:32766/gpushare-scheduler"
  filterVerb: filter
  bindVerb: bind
  enableHTTPS: false
  nodeCacheCapable: true
  managedResources:
  - name: aliyun.com/gpu-mem
    ignoredByScheduler: false
  ignorable: false

And add policy config file parameter in scheduler arguments like this: --config=/etc/kubernetes/schedule-framework-config.yaml

YuuinIH commented 1 year ago

https://github.com/AliyunContainerService/gpushare-scheduler-extender/issues/145#issuecomment-1480903615