Closed narrowsnap closed 4 years ago
Thank you for sharing. The reason why I implement this is only for understanding more about paper. If I can't reproduce performance same as write in the paper, which means that there is still missing to understand fully.
I know this repository become more getting interest but still busy these days. Anyway, thank you for the sharing and I will figure out the reason to reproduce this someday!
Hi, I read your code which is cool, and run it. But the APN model which implement from the Paper is not concise and confused me.
I spend much time on thinking about it. I suddenly see the light when I read The Spatial Transformer Networks.
All the crop operate can description with a 2x3 matrix([[0.5, 0.0, 0.0], [0.0, 0.5, 0.0]]).
M00 and M11 mean the size of crop area, M02 and M12 mean the coordinate of crop area.
Which means we only need to train 2 parameter(M02 and M12) to find the attention area.
And The Spatial Transformer Networks can autograd which we don't need to write the complex crop function.
I'm not finish trained The Spatial Transformer Networks when I write this text. I just want to share the stn net with you. That's wonderful!
Hope this can help you!