bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
373 stars 48 forks source link

replace repeat_interleave with basic torch functions #79

Closed mayank31398 closed 1 year ago

mayank31398 commented 1 year ago

related: https://github.com/NVIDIA/Megatron-LM/issues/543