kyegomez / Mixture-of-Depths

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
MIT License
61 stars 5 forks source link

Is there an official implementation? #39

Open BrownTan opened 1 week ago

BrownTan commented 1 week ago

Is there an official implementation?

Upvote & Fund

Fund with Polar

github-actions[bot] commented 1 week ago

Hello there, thank you for opening an Issue ! 🙏🏻 The team was notified and they will get back to you asap.