google / jetstream-pytorch

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
Apache License 2.0
33 stars 14 forks source link

Add mlperf benchmark for offline for mixtral #153

Closed qihqi closed 1 month ago

qihqi commented 1 month ago

no server use 3 buckets of different cache length (512, 1280, 3072)

FanhaiLu1 commented 1 month ago

Can you fix the lint and unit test error?

qihqi commented 1 month ago

Can you fix the lint and unit test error?

I'll do it when we merge it to main