issues
search
NVIDIA
/
Megatron-LM
Ongoing research training transformer models at scale
https://docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start
Other
9.23k
stars
2.08k
forks
source link
draft: Bert context parallelism support
#874
Closed
JimmyZhang12
closed
1 week ago