Open youngwanLEE opened 7 years ago
@youngwanLEE I found your WR-Inception+SSD. Does it better than VGG16 in human detection? Thanks.
@KeyKy yes, of course, various strategies for KITTI are needed.
@youngwanLEE Sorry to bother you again. Where did you get the ImageNet pretrained model of WR-Inception?
@KeyKy I directly trained pretrained model.
@Keyky can you share DSSD code? Thanks
@roipony I only talk about SSD here.
@youngwanLEE can you share DSSD code? Thanks
@youngwanLEE Do you implement ResNet+SSD successfully?
can you please explain to me what are the meaning of min_ratio and step and aspect_ratios
Hi, Wei.
I have implemented your DSSD on VOC and ILSVRC16.
I implemented PM(Prediction Module) + DM(Deconvolution Module) based on SSD-VGG-512 model and achieve mAP 80.87 on VOC07test (trained on 07+12 from scratch).
However, when implementing the original ResNet101-based SSD, the accuracy is 77 mAP.
My settings are belows :
for Res101-SSD-321
for Res101-SSD-512
I tried to experiment many trials. But the result is as not good as your results on your paper.
I guesses two things that 1) the default box scale setting. 2) the effect of batch size on one device.
In your DSSD paper's page 7, you mentioned that
"According to our observation, a batch size smaller than 16 and trained on 4 GPUs can cause unstable results in batch normalization and hurt accuracy."
This means that if I use a batch size smaller than 4 on one device, accuracy will get hurt due to unstable batch normalization?
My question are :
1) How to set default box scale for ResNet101 for DSSD ? -- are bigger than those of VGG or smaller ? 2) The effect of batch size on a device.
thanks in advance.