Closed YiguoHe closed 2 months ago
Hello! The pseudo-code of your fine-tuning process shows that 'the total loss =loss fine + alpha * loss_coarse'. Have you ever studied the effect of different values of this 'alpha' parameter? thank you! Best wishes!
We made some ablations on alpha. The result is similar for alpha=0.1 or alpha=0.2.
Thank you very much!
Hello! The pseudo-code of your fine-tuning process shows that 'the total loss =loss fine + alpha * loss_coarse'. Have you ever studied the effect of different values of this 'alpha' parameter? thank you! Best wishes!