JierunChen / FasterNet

[CVPR 2023] Code for PConv and FasterNet
691 stars 55 forks source link

Some problems in section 3.1 #42

Open juan19941228 opened 1 year ago

juan19941228 commented 1 year ago

The equation (2) for calculating memory access does not include the pointwise convolution.

JierunChen commented 1 year ago

@juan19941228 Yes, equation (2) only compares the memory access for spatial feature extraction and does not account for the memory access by the pointwise convolution (PWConv).

abcsimple commented 1 year ago

Is formula 2 only applicable to MobileNet v2, or can it be used with other models as well? This is an important question because in some cases, such as when DWConv is implemented before Conv 11, the memory access can be calculated using the formula h w 2c + k^2 c, which results in lower memory usage compared to a regular Conv.

JierunChen commented 1 year ago

@abcsimple Hi, the comparison of equations (2) and (3) is based on a reasonable assumption that the width or the number of channels when using DWConv is generally higher than that when using Conv.

abcsimple commented 1 year ago

@JierunChen Thank you for your response. The number of channels in DWConv is increased by the Pointwise Convolution. If DWConv is implemented before Pointwise Convolution, the channel should be the same as the input feature map. Please let me know if I have misunderstood anything.