sanghoon / pva-faster-rcnn

Demo code for PVANet
https://arxiv.org/abs/1611.08588
Other
651 stars 241 forks source link

Scale and BN on training #88

Open dereyly opened 7 years ago

dereyly commented 7 years ago

Hello Why we cant merge (gen_merged_model.py) all batchNorm on training -- all BN are constant batch_norm_param {use_global_stats: true} and some Scale on first 2 blocks (until conv3 block)

Additional questions: -- Training are better with lr_mult: 0.1 on feature extraction? -- Why BN on FC layers constant too ( use_global_stats: true ) maybe its better to adapt batch (256) with BN?