GoogleCloudPlatform / ai-on-gke

AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
Apache License 2.0
186 stars 140 forks source link

Bump Jetstream, maxtext, jetstream-pytorch versions in Jetstream inference server guide #695

Closed Bslabe123 closed 1 month ago

Bslabe123 commented 1 month ago

Note: the text_content=jetstream_pb2.DecodeRequest.TextContent(text=request.prompt) change fixes a Jetstream change that broke making requests to the endpoint, the issue around decoding responses still exists, more to do there.

liurupeng commented 1 month ago

/gcbrun