tianzhi0549 / FCOS

FCOS: Fully Convolutional One-Stage Object Detection (ICCV'19)
https://arxiv.org/abs/1904.01355
Other
3.28k stars 630 forks source link

Exp(s_i,x) with a trainable scalar s_i #243

Closed ajoseph12 closed 4 years ago

ajoseph12 commented 4 years ago

Hey, Thank you for the lovely work and provoking more research in the direction of Anchor Free Detectors. Having read the paper and gone through the code I had a specific question concerning the normalization of regression outputs in view of the recent ameliorations realized to FCOS. From your code, it's clear that:

However, in the paper/thesis, you have clearly mentioned that during training the predictions are normalized by a scalar 's' using the equation exp(s,x), which equates to s^x, where s is the base and x the exponent. Additionally, it's mentioned that this scalar ought to be learnable for tailored regression towards targets at different pyramid levels (P3 to P7).

My question is, is dividing the regression targets (by the strides) during training and then multiplying regression predictions (by the strides) during testing tantamount to the training methodology described in the paper/thesis - rendering 's' trainable?

tianzhi0549 commented 4 years ago

@ajoseph12 all the methods yield similar performance, thus you should not focus on this point. We recommend using the one in the improved version.

ajoseph12 commented 4 years ago

Perfect! thanks