WongKinYiu / CrossStagePartialNetworks

Cross Stage Partial Networks
https://github.com/WongKinYiu/CrossStagePartialNetworks
894 stars 172 forks source link

How do I derive an equations of weight updating ? #48

Open terasakisatoshi opened 3 years ago

terasakisatoshi commented 3 years ago

I'm reading the article for CSPNet: A New Backbone that can Enhance Learning Capability of CNN.

It is not quite clear Equation (6) shown the image below:

image

Ho did you derive the equation (6)? Also it's not clear the definition of g_i represents the gradient propagated to the i^th dense layer. Could you explain more detail?

Thank you.

terasakisatoshi commented 3 years ago

I'm not sure this is right???

image

xarryon commented 3 years ago

I think so and the gradient of Wi can be computed by: dLoss/dwi=gi(dxi-1/dwi)