Closed gregwebs closed 5 years ago
pd-0 has a bound cloud disk already, whereas the others do not. It seems as if pd-0 was getting scheduled to a node but then it didn't work out. However, there are no Events for the pd-0 pod (or the pvc or the statefulset). Maybe the pod failed and got re-created?
kubectl get pvc -n tidb7 | grep pd
NAME STATUS VOLUME CAPACITY ACCESS MODES STORAGECLASS AGE
pd-demo-pd-0 Bound pvc-dc8019c5-ad4c-11e9-a5c8-4201ac1f4009 5Gi RWO pd-ssd-wait 137m
pd-demo-pd-1 Pending pd-ssd-wait 137m
pd-demo-pd-2 Pending pd-ssd-wait 137m
the probeTime
of the pod is not recorded by the scheduler.
$ kubectl -n tidb7 get pods demo-pd-0 -ojsonpath='{.status}{"\n"}'
map[phase:Pending qosClass:Burstable]
you can follow the PR https://github.com/pingcap/tidb-operator/pull/611/files to fix the RBAC permission of the tidb-scheduler
Okay, thanks! Why doesn't an error show up in the scheduler when this happens?
We can close this out, but I think it is a bug if these issues don't show up in our logs.
Thank you! One of the worst errors to deal with is silent permission issues. I am surprised that k8s is not already linting for unhandled errors.
Bug Report
This may be the same as #602. pd-0 is Pending and is not seen in the scheduler logs. pd-1 and pd-2 are waiting for pd-0 to be scheduled.
What version of Kubernetes are you using? 1.13.6
What version of TiDB Operator are you using?
After seeing this problem I tried running the latest master and I also tried running rc1. Neither will schedule PD.
Logs
The kube-scheduler log repeats with
tidb-scheduler repeats with