bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
374 stars 49 forks source link

Add tokens-per-second-gpu to the printed logs instead of just wandb #54

Closed loubnabnl closed 1 year ago