Closed isalirezag closed 6 years ago
I don not understand how the scale is used in L2Norm function I appreciate it if someone can explain it for me please
This is proposed by the original ssd paper. This is used because of the different scale between different layers in the VGG. In the other networks, we prefer to use the regular batch norm rather than L2 norm.
I don not understand how the scale is used in L2Norm function I appreciate it if someone can explain it for me please