Closed skaae closed 8 years ago
Nice!
Have you tried this for some generative task? From skimming through the paper it seems it only compares WN w/ Polyak averaging (and norm bounding?) with no normalization at all. It would be nice to also compare to BN and WN + mean-only BN, and maybe without the averaging.
preliminary implementation of http://arxiv.org/abs/1602.07868
This is an implementation of equation 1+2 in the paper. Please check if that I didn't take norm over the wrong axis :)
Should we support mean--only batch normalization? In that case i think the easiest is to create mean-only batch norm layer and wrap the weight normalization layer.
Maybe we should provide a helper function for mean-only bn + WN