microsoft / tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation
MIT License
710 stars 85 forks source link

move `fp32_gate` checking from moe_layer to top #150

Closed ghostplant closed 2 years ago