Thanks for your excellent eork!I have a problem with finetuning the model("first stage.pt").When I train with the vanilla SVD model,I can get the following 2 model weights."mp_rank_00_model_states.pt" has the same size as "first stage.pt",but I cannot get the "zero_pp_rank_0_mp_rank_00_optim_states.pt" matching the "first stage.pt".
Therefore,if I want to finetune the model with more datasets,I replace "first stage.pt" with "mp_rank_00_model_states.pt",but how to get the "zero_pp_rank_0_mp_rank_00_optim_states.pt"?
Thanks for your excellent eork!I have a problem with finetuning the model("first stage.pt").When I train with the vanilla SVD model,I can get the following 2 model weights."mp_rank_00_model_states.pt" has the same size as "first stage.pt",but I cannot get the "zero_pp_rank_0_mp_rank_00_optim_states.pt" matching the "first stage.pt". Therefore,if I want to finetune the model with more datasets,I replace "first stage.pt" with "mp_rank_00_model_states.pt",but how to get the "zero_pp_rank_0_mp_rank_00_optim_states.pt"?