issues
search
IBM
/
dolomite-engine
Dolomite Engine is a library for pretraining/finetuning LLMs
Apache License 2.0
23
stars
7
forks
source link
Load balancing loss
#6
Closed
shawntan
closed
2 months ago
shawntan
commented
2 months ago
Load balancing loss computation for MoE
Load balancing loss computation for MoE