Open fmiao2372 opened 2 months ago
When Megatron-DeepSpeed support llama3/llama3.1 pretraining?
llama3.1 and llama3 is similar to llama2, so you don't need change your code
When Megatron-DeepSpeed support llama3/llama3.1 pretraining?