Closed ZhimingZhou closed 7 years ago
Hi, it seems we can do batch normalization, and the at the same time, re-parameterize the weight, for better/robust optimization. Would batch normalization help, given we did the re-parameterization?
Do you have some experiments on this?
Hi, it seems we can do batch normalization, and the at the same time, re-parameterize the weight, for better/robust optimization. Would batch normalization help, given we did the re-parameterization?
Do you have some experiments on this?