AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
Apache License 2.0
186
stars
140
forks
source link
Bump Jetstream, maxtext, jetstream-pytorch versions in Jetstream inference server guide #695
Note: the text_content=jetstream_pb2.DecodeRequest.TextContent(text=request.prompt) change fixes a Jetstream change that broke making requests to the endpoint, the issue around decoding responses still exists, more to do there.
Note: the
text_content=jetstream_pb2.DecodeRequest.TextContent(text=request.prompt)
change fixes a Jetstream change that broke making requests to the endpoint, the issue around decoding responses still exists, more to do there.