Open terasakisatoshi opened 3 years ago
I'm reading the article for CSPNet: A New Backbone that can Enhance Learning Capability of CNN.
It is not quite clear Equation (6) shown the image below:
Ho did you derive the equation (6)? Also it's not clear the definition of g_i represents the gradient propagated to the i^th dense layer. Could you explain more detail?
g_i
Thank you.
I'm not sure this is right???
I think so and the gradient of Wi can be computed by: dLoss/dwi=gi(dxi-1/dwi)
I'm reading the article for CSPNet: A New Backbone that can Enhance Learning Capability of CNN.
It is not quite clear Equation (6) shown the image below:
Ho did you derive the equation (6)? Also it's not clear the definition of
g_i
represents the gradient propagated to the i^th dense layer. Could you explain more detail?Thank you.