google / jetstream-pytorch

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
Apache License 2.0
21 stars 12 forks source link

fix mixtral quantization scaler axis when dimension > 2 #132

Closed sixiang-google closed 4 weeks ago