Closed Minami-su closed 9 months ago
ref_model=None means: This option is set when the model you want to train is the same as the model you froze.
This has the effect of reducing GPU allocation.
So, I think, ref=None model option is not the cause..
ref_model=None means: This option is set when the model you want to train is the same as the model you froze.
This has the effect of reducing GPU allocation.
So, I think, ref=None model option is not the cause..
Okay, maybe there's something wrong with the code in the qwen model
Let me know if you have any other information to share!
I trained my model, set the reference model to None, and observed no changes in the trained model compared to the original model during inference.