kyegomez / Mixture-of-Depths

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
MIT License
55 stars 2 forks source link

Example code #13

Open drdsgvo opened 3 months ago

drdsgvo commented 3 months ago

There is a file example.py, but it is empty. Will an illustrative example code be published?

Upvote & Fund

Fund with Polar

github-actions[bot] commented 3 months ago

Hello there, thank you for opening an Issue ! 🙏🏻 The team was notified and they will get back to you asap.