GoogleCloudPlatform / ai-on-gke

AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
Apache License 2.0
186 stars 140 forks source link

Fix jetstream inference servers #678

Closed francescov1 closed 1 month ago

francescov1 commented 1 month ago

Small fixes to support recent changes in the Jetstream repos for the inference server pytorch example.

This fix will likely break maxtext since the http-server is shared by both repos. Either need to update maxtext or split out the http server for each implementation.

google-cla[bot] commented 1 month ago

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

vivianrwu commented 1 month ago

/gcbrun