In that multi_task_loss paper (2018CVPR), author calculated classification loss through cross_entropy(CE) scaled by sigma^2 then plus log(sigma). But in your code, it seems to be calculated through CE scaled by sigma^4. I wanna know whether It's a mistake or a trick. Thanks.
In the code of the multi-task loss function, I would like to know why the classification loss is calculated in that way.
In that multi_task_loss paper (2018CVPR), author calculated classification loss through cross_entropy(CE) scaled by sigma^2 then plus log(sigma). But in your code, it seems to be calculated through CE scaled by sigma^4. I wanna know whether It's a mistake or a trick. Thanks.