Closed AlpacaManAlpha closed 1 day ago
For Q2, you can load the checkpoint and check the keys, especially train_step
When I reduce my checkpoints, will f5 use these for further training?
yes, but start from step=0, and all optimizer scheduler states reset
Does it need the .pt or will .safetensors also do?
just .pt is fine
Since I plan to train an assortment of speakers, storage will be an issue for me eventually.
you are using a mixed a dataset or train separate models? set a larger saving interval is fine
Is there a difference between model_last and the latest checkpoint?
the model_last
is the lastest while the ckpt like model_200000.pt
maybe not
it is quite straightforward, that last
is literally the last, 200000 is literally the ckpt for 200k step ckpt
Will close this issue, feel free to ask question.
Checks
Question details