bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
371 stars 48 forks source link

replace repeat_interleave with basic torch functions #79

Closed mayank31398 closed 10 months ago

mayank31398 commented 10 months ago

related: https://github.com/NVIDIA/Megatron-LM/issues/543