Open Usernamezhx opened 7 years ago
Hi, @Usernamezhx ,
I think you didn't get the theta right. You should not initialize the weight to all zeros for that layer. Also, you should set the bias constant to make sure the initial thetas form identity matrix. You can always visualize what you got from the transformation after the st layer.
Bests, Kaichun
@daerduoCarey
why do we need to set initial thetas to identity matrix? what is the math intuition behind that?
hi daerduoCarey First of all. thanks for sharing your code . I add the stn layer after the data layer:
but when i train for about 5w iteration .it turn out that :the loss and the accuracy is nearly constant:
caffe.txt