Open cbalioglu opened 9 months ago
Introduce Megatron style model parallelism.
hi @cbalioglu, when will this feature be released? By the way, can I use deepspeed to train seamless communication model?
Introduce Megatron style model parallelism.