Closed vince62s closed 6 months ago
I did not try to reproduce the training process but I have a question on the paper and the code.
when you add up cpo_loss and clm_loss what are the orders? I mean are both component about the same value is one really smaller than the other one ?
thanks.
It depends on the tasks. But for machine translation, they are in the similar order.
I did not try to reproduce the training process but I have a question on the paper and the code.
when you add up cpo_loss and clm_loss what are the orders? I mean are both component about the same value is one really smaller than the other one ?
thanks.