google / JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Apache License 2.0
194 stars 24 forks source link

Change the default message for requester.py and remove mlperf 4.1 install for proxy version support. #136

Closed zhihaoshan-google closed 1 week ago

zhihaoshan-google commented 1 week ago

using a more positive default message for requester.py.