MohsenZand / ObjectBox

(ECCV 22 Oral) ObjectBox: From Centers to Boxes for Anchor-Free Object Detection
GNU General Public License v3.0
131 stars 16 forks source link

Questions about S part of SDIOU #17

Closed 0phoff closed 1 year ago

0phoff commented 1 year ago

Hey, Thanks for your inspiring work, it is quite refreshing to see a new take on object detection.

I had a question about the formulation of the S part of your loss function. According to Fig 4 in your article, the non-overlapping area S is equal to the two blue boxes. However, looking at your formula S = (L* - L)² + (T* - T)² + (R* - R)² + (B* - B)², it seems to visually be rather equal to 4 red boxes below. It seems your formulation is based on the scaled balanced loss paper, but could you explain the reason for this squared distance in more details ?

non-overlapping-area

MohsenZand commented 1 year ago

Thank you! What you have shown is literally the same as below:

Untitled

S = [(L - L)² + (B - B)²]+ [(T - T)² + (R - R)²], where [(L - L)² + (B - B)²] and [(T - T)² + (R - R)²] denote the square of diagonal of red rectangles A and B, respectively. As similar to scale balanced loss paper, we use the squared Euclidean distance to measure the distance between different points. We however show the area of the rectangles in the figure to make it more intuitive.

0phoff commented 1 year ago

Ok, that's what I figured after reading the scale balanced loss paper, but thanks for clearing this out! :)

xiaobai111111223 commented 1 year ago

Can this loss function be used directly in FCOS?