Open LiaoSirui opened 10 months ago
遇到调度失败的问题,有大佬能给点排查思路吗
环境信息:
# helm list -n crane-system
NAME NAMESPACE REVISION UPDATED STATUS CHART APP VERSION
scheduler crane-system 3 2023-11-21 10:12:59.63159177 +0800 CST deployed scheduler-0.2.2 0.2.2
# helm get values -n crane-system scheduler
USER-SUPPLIED VALUES:
controller:
enable: true
image:
repository: dockerhub.bigquant.ai:5000/aipaas-devops/3rdparty/docker.io/gocrane/crane-scheduler-controller
tag: 0.0.24
name: crane-scheduler-controller
replicaCount: 3
global:
prometheusAddr: http://kube-prometheus-kube-prome-prometheus.monitoring.svc.cluster.local:9090
scheduler:
enable: true
image:
repository: dockerhub.bigquant.ai:5000/aipaas-devops/3rdparty/docker.io/gocrane/crane-scheduler
tag: 0.0.23
name: crane-scheduler
replicaCount: 3
# kubectl version
Client Version: version.Info{Major:"1", Minor:"25", GitVersion:"v1.25.10", GitCommit:"e770bdbb87cccdc2daa790ecd69f40cf4df3cc9d", GitTreeState:"clean", BuildDate:"2023-05-17T14:12:20Z", GoVersion:"go1.19.9", Compiler:"gc", Platform:"linux/amd64"}
Kustomize Version: v4.5.7
Server Version: version.Info{Major:"1", Minor:"25", GitVersion:"v1.25.10", GitCommit:"e770bdbb87cccdc2daa790ecd69f40cf4df3cc9d", GitTreeState:"clean", BuildDate:"2023-05-17T14:06:35Z", GoVersion:"go1.19.9", Compiler:"gc", Platform:"linux/amd64"}
若为替换方式安装,/etc/kubernetes/manifests/kube-scheduler.yaml
中设置 scheduler 命令行参数:--leader-elect=true
,参考 https://github.com/gocrane/crane-scheduler/blob/c2c05338a5d75c0a6d92bd16a1cf257b48b30ef8/deploy/scheduler/deployment.yaml#L33
调度失败