xingyizhou / ExtremeNet

Bottom-up Object Detection by Grouping Extreme and Center Points
BSD 3-Clause "New" or "Revised" License
1.03k stars 172 forks source link

What about the test speed on ExtremeNet #41

Open jerrywgz opened 4 years ago

jerrywgz commented 4 years ago

Could you please provide the test speed info ?

AlexeyAB commented 4 years ago

@jxingyizhou Hi,

Greate work!

https://www.zpascal.net/cvpr2019/Zhou_Bottom-Up_Object_Detection_by_Grouping_Extreme_and_Center_Points_CVPR_2019_paper.pdf

We use flip augmentation for testing. In our main comparison, we use additional 5⇥ multi-scale (0.5, 0.75, 1, 1.25, 1.5) augmentation. Finally, Soft-NMS [1] filters all augmented detection results. Testing on one image takes 322ms (3.1FPS), with 168ms on network forwarding, 130ms on decoding and rest time on image pre- and post-processing (NMS). ... Edge aggregation Edge aggregation (Section 4.3) gives a decent AP improvement of 0.7%. It proofs more effective for larger objects, that are more likely to have a long axis aligned edges without a single well defined extreme point. Removing edge aggregation improves the decoding time to 76ms and overall speed to 4.1 FPS.

ExtremeNet (SS) Hourglass-104 511x511 - 40.2% AP - 55.5% AP50 - 20.5 FPS (V) ExtremeNet (MS) Hourglass-104 511x511 - 43.7% AP - 60.5% AP50 - 4.1 FPS (V)