I am confused that why you use the softmax to v_length here. since i have not found this operate in hitton's paper and Figure 1?
In contrast, it seems that the capsnet allows a muti-label classification which means that it is not necessary to use the softmax to v_length, according to Section 3 of hitton's paper (To allow for multiple digits, we use a separate margin),
I am confused that why you use the softmax to v_length here. since i have not found this operate in hitton's paper and Figure 1? In contrast, it seems that the capsnet allows a muti-label classification which means that it is not necessary to use the softmax to v_length, according to Section 3 of hitton's paper (To allow for multiple digits, we use a separate margin),