Closed Caozhou1995 closed 2 months ago
This PR is to be closed because in the future we will focus on megatron's dist ckpt functionality and make related optimizations based on it. At the same time, we used DLRover as backup, see PR for details: https://github.com/FlagOpen/FlagScale/pull/155
This PR adds FlagScale's Flash Checkpoint feature. [TBD]