CentaurusInfra / arktos

Arktos for large-scale cloud platform
Apache License 2.0
245 stars 69 forks source link

kube-controller-manager/workload-controller-manager on admin cluster will crash when running 5000 hollow-nodes pods #95

Open sonyafenge opened 4 years ago

sonyafenge commented 4 years ago

What happened: using kubemark to simulate 5000 nodes, kube-controller-manager/workload-controller-manager on admin cluster will crash when running 5000 hollow-nodes pods

What you expected to happen: no crash on admin cluster How to reproduce it (as minimally and precisely as possible):

  1. run the commands below to start admin cluster and kubemark cluster
    export MASTER_DISK_SIZE=750GB KUBE_GCE_ZONE=us-west2-b MASTER_SIZE=n1-highmem-96 KUBE_GCE_NETWORK=default
    ./cluster/kube-up.sh
    export MASTER_DISK_SIZE=750GB KUBE_GCE_ZONE=us-west2-b MASTER_SIZE=n1-highmem-96 KUBE_GCE_NETWORK=kubemark
    ./test/kubemark/start-kubemark.sh
  2. start 5000 hollow nodes
  3. check admin cluster status and found kube-controller-manager/workload-controller-manager will crash around 500 nodes. Anything else we need to know?:

Environment:

sonyafenge commented 4 years ago

image image