Closed BB88Lee closed 4 years ago
@BB88Lee This is widely discussed in many papers such as AlignDet [1], Guided Anchor [2], DAFS [3]. In my view, the different lies between whether appling NMS (which is the most time-summing part) in the proposal generation module. Furhter more, the RoI pooling (which requires pixel bining) has more complixity than feature warping/sampling/adaptation (e.g. applying deconv to adapt feature map based on guided anchor). The boundary between the two-stage and one-stage are becomming more and more ambiguous.
thanks for the nice work!
I have some doubts about the naming of the method. In the code, I saw some operations similar to the two-stage network.
difference with normal two-stage network:
Could you explain your idea about "single stage"?