IBM / dolomite-engine

Dolomite Engine is a library for pretraining/finetuning LLMs
Apache License 2.0
23 stars 7 forks source link

Load balancing loss #6

Closed shawntan closed 2 months ago

shawntan commented 2 months ago

Load balancing loss computation for MoE