celestiaorg / test-infra

Testing infrastructure for the Celestia Network
Apache License 2.0
25 stars 10 forks source link

testground/daemon: cluster-autoscaler improvements #153

Closed Bidon15 closed 1 year ago

Bidon15 commented 1 year ago

ATM, the daemon is starting up the pods and throwing errors during scaling up of the cluster's instances to accommodate the load given to it.

This can result in unstable behaviours like:

In order to fix both issues above, it might be beneficial for the daemon to know upfront when the scaling has been complete to start spinning up pods for test run

smuu commented 1 year ago

https://www.notion.so/celestiaorg/DevOps-Berlin-May-June-mini-onsite-da1c4795d16d4bedba741b93e6e41d0c#bf200c5ae55a43a2aba9f954c7b8eb24