Closed xinyu-intel closed 5 months ago
minor fix in ParallelMLP
apply https://github.com/microsoft/Megatron-DeepSpeed/pull/104 to more test cases
cc @polisettyvarma
minor fix in ParallelMLP
apply https://github.com/microsoft/Megatron-DeepSpeed/pull/104 to more test cases