Open drcanchi opened 1 month ago
By default, Megatron-DeepSpeed disables async_tensor_model_parallel_allreduce. Is there any plan to enable this feature ?
https://github.com/microsoft/Megatron-DeepSpeed/blob/main/megatron/arguments.py#L398
By default, Megatron-DeepSpeed disables async_tensor_model_parallel_allreduce. Is there any plan to enable this feature ?
https://github.com/microsoft/Megatron-DeepSpeed/blob/main/megatron/arguments.py#L398