microsoft / tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation
MIT License
723 stars 93 forks source link

keep output dims fixed for fast_encode/fast_decode #117

Closed ghostplant closed 2 years ago