Closed LYJ903118120 closed 3 years ago
Thank you for your attention. Actually, it's a bit of a coincidence. In terms of the final form, cel can indeed be transformed into a form similar to dice loss.
But the difference is that when we build this loss, we derive it intuitively according to the needs of the task itself and the proportional relationship between regions.
So, we can easily take the form of CEL, and the picture here shows the corresponding conception process of CEL.
==>>
In addition, we analyze the effectiveness of cel, that is, the form of its gradient is more effective for this task than cross-entropy.
In Equ. (8), except that the numerator term 1 − 2g is position-specific, the other terms are image-specific. And this numerator is closely related to the binary ground truth, which results in that the inter-class derivatives have large differences while the intra-class ones are relatively consistent. This has several merits: 1) It ensures that there is enough large gradient to drive the network in the later stage of training; 2) It helps solve the intra-class inconsistency and inter-class indistinction issues, to some extent, thereby promoting the predicted boundaries of salient objects to become sharper.
Hello, may CEL loss be Dice loss, right?