Closed maoyuanpeng closed 3 years ago
I want to ask why we use L2 regularization for each of these convolution layers and use "he_normal" to initialize the convolution kernel parameters.
Is it good for training?
I want to ask why we use L2 regularization for each of these convolution layers and use "he_normal" to initialize the convolution kernel parameters.