google / maxtext

A simple, performant and scalable Jax LLM!
Apache License 2.0
1.39k stars 247 forks source link

Enable quantization for MoE Gating #757

Closed RissyRan closed 1 week ago

RissyRan commented 2 weeks ago

Description

Test

Adding quantization=int8 flag: