AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
Apache License 2.0
211
stars
154
forks
source link
Jetstream Maxtext Deployment Module: All scale rules now in a single HPA #730
Having multiple HPAs monitoring the same resource causes a race condition. Keeping all the rules in the same HPA fixes this.