logits bij not initialized to 0

@vijaykumar01 In the paper it wasn't entirely clear to me - yes, in the Procedure 1 they note the initialization to 0, but in paragraph 2 they write:

(...) initial logits bij are the log prior probabilities that capsule i should be coupled to capsule j.

The way I interpreted it is that the network can learn the good prior - initialization - which capsules should be capsuled with which. I thought it's more general and data-driven.

Later they released their tensorflow intiliazation and it seems that in the end initiali logits bij are indeed just set to 0 in their case. However, since making these values learnable parameters didn't break the training and (in my opinion) could potentially help, I kept them that way. It's a good remark though.

adambielski / CapsNet-pytorch

logits bij not initialized to 0 #8