AI-Hypercomputer / JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Apache License 2.0
202 stars 26 forks source link

Format token utils and test #51

Closed FanhaiLu1 closed 5 months ago

FanhaiLu1 commented 5 months ago

Format token utils and test