microsoft / mttl

Building modular LMs with parameter-efficient fine-tuning.
MIT License
78 stars 7 forks source link

Moe training fixes #91

Closed oleksost closed 2 months ago

oleksost commented 2 months ago