Closed allanj closed 8 months ago
The plan is keeping the codebase as minimal as possible with a more explicit and accessible design for users. And at least on par or faster performance than megatron deepspeed