kamanphoebe / Look-into-MoEs

A Closer Look into Mixture-of-Experts in Large Language Models
https://arxiv.org/abs/2406.18219
MIT License
37 stars 0 forks source link