Open tobylyf opened 1 year ago
In iteration t (t>1), is the model fine-tuned based on C+T pre-trained model, or the "iteration t-1" model?
Hi, in iteration t (t>1), we finetune the model based on the 'iteration t-1' model.
Using C+T to generate dataset. Should I load C+T as initialization to train the model using generated data?
In iteration t (t>1), is the model fine-tuned based on C+T pre-trained model, or the "iteration t-1" model?