lartpang / MINet

CVPR2020, Multi-scale Interactive Network for Salient Object Detection
https://openaccess.thecvf.com/content_CVPR_2020/html/Pang_Multi-Scale_Interactive_Network_for_Salient_Object_Detection_CVPR_2020_paper.html
MIT License
237 stars 29 forks source link

About CEL Loss #16

Closed LYJ903118120 closed 3 years ago

LYJ903118120 commented 3 years ago

Hello, may CEL loss be Dice loss, right?

lartpang commented 3 years ago

Thank you for your attention. Actually, it's a bit of a coincidence. In terms of the final form, cel can indeed be transformed into a form similar to dice loss.

But the difference is that when we build this loss, we derive it intuitively according to the needs of the task itself and the proportional relationship between regions.

So, we can easily take the form of CEL, and the picture here shows the corresponding conception process of CEL.

image ==>> image

In addition, we analyze the effectiveness of cel, that is, the form of its gradient is more effective for this task than cross-entropy.

image

In Equ. (8), except that the numerator term 1 − 2g is position-specific, the other terms are image-specific. And this numerator is closely related to the binary ground truth, which results in that the inter-class derivatives have large differences while the intra-class ones are relatively consistent. This has several merits: 1) It ensures that there is enough large gradient to drive the network in the later stage of training; 2) It helps solve the intra-class inconsistency and inter-class indistinction issues, to some extent, thereby promoting the predicted boundaries of salient objects to become sharper.