GoogleCloudPlatform / ai-on-gke

AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
Apache License 2.0
186 stars 140 forks source link

Enable Ray Autoscaler for the Rag example application #722

Closed gongmax closed 21 hours ago

gongmax commented 1 week ago

Enable Ray Autoscaler for the Rag example application. Tested with both GKE Autopilot cluster and Standard cluster with Cluster Autoscaler.

gongmax commented 1 week ago

/gcbrun

gongmax commented 2 days ago

/gcbrun

gongmax commented 2 days ago

/gcbrun

roberthbailey commented 2 days ago

Both of the presubmit tests failed. Can you investigate and see if the failures are related to your PR?

gongmax commented 1 day ago

/gcbrun

gongmax commented 1 day ago

Both of the presubmit tests failed. Can you investigate and see if the failures are related to your PR?

Looks like they were flaky not related to the PR. Retry succeeded.