I have puzzled about the sequence of merging;
http://ethereon.github.io/netscope/#/gist/db945b393d40bfa26006; this picture told me that I should do BM first and then merge the input with residual and finally execute the activation function ReLU. However, I found that the author do merging first and after that do the BN and ReLu in the code. So, I wanna know whether there is the differnce between this two kinds of ways for the architure?
I have puzzled about the sequence of merging; http://ethereon.github.io/netscope/#/gist/db945b393d40bfa26006; this picture told me that I should do BM first and then merge the input with residual and finally execute the activation function ReLU. However, I found that the author do merging first and after that do the BN and ReLu in the code. So, I wanna know whether there is the differnce between this two kinds of ways for the architure?