Closed ChristofferM closed 5 years ago
The paper didn't mention this. The biases term is motivated by the fact that capsule is essentially an extend of neuron from two aspect: 1. the separation of coupling assignment c from weight w; 2. matrix representation of x_i instead of scalar
the coupling assignment c is learned by routing algorithm, and w by back-propagation. It's not mentioned in paper, but similar to neuron, a bias term for intercept would be more reasonable. For a more intuitive comparison of capsule and neuron, see the picture in readme. We will soon publish our work discussing the relation of capsule and neuron in more detail, welcome to follow
Thanks! Looking forward to it.
In 5b4eb1a biases for s_j were added. How is this motivated? It is not in the NIPS-paper? Also shouldn't B_IJ be learned?