Effect of bias in linear layers

bioinf-jku / SNNs

Tutorials and implementations for "Self-normalizing networks"

GNU General Public License v3.0

1.58k stars 199 forks source link

I've been experimenting with SELUs, and found they provide an improvement in terms of computation time during training with respect to batchnorm, thank you for your work.

I just have a question regarding the effect of bias in linear layers. As I understand it, every neuron should have mean zero in order to stay in the self regularizing zone, but bias precisely shifts that mean. In my experiments however I didn't see much of an effect either removing or adding biases. I see that in the tutorial notebook bias is used, and I wonder wether you've considered the issue.

bioinf-jku / SNNs

Effect of bias in linear layers #16