Closed martinferianc closed 1 year ago
@martinferianc you're right, pytorch supports any number of dimensions that can come before the feature dimension. I'll change to computations accordingly. Initially that hook supposes 1d case, which was massively used in old-fashioned CNNs
Understood and thank you!
Hey! Thanks for amazing work. I want to ask, is the FLOPs counting method for 3D input for a linear layer correct?
Right now it is:
The bias is counted only once for a 3D input, if I am not mistaken bias is added to each element of the output, so shouldn't it be: