Closed madoka109 closed 2 years ago
Thank you for your comment. Here we just follow the standard implementation of CrossEntropy loss in Pytorch, please refer to the code directly,which means using the prob or the logits is nothing but a matter of expression. As for not adding the additional division of 2 for L_rec, well, you can see it is just a matter of scale, isn't it?
And that's why your L_rec doesn't need to be divided by 2?