ResNet50 stride is greater than filter size

keras-team / keras-applications

Reference implementations of popular deep learning models.

Other

2k stars 910 forks source link

On lines 240, 245, and 252 of the ResNet50 implementation the default value of (2, 2) for stride is used. In the conv_block on lines 114 and 131 (for both the main path and the shortcut) a filter of size (1, 1) is used with the (2, 2) stride. However, wouldn't that ignore half of the values because the filter is smaller than the stride? I propose that the conv_block should use zero_padding of 1 and then a (3,3) filter instead. This way all the sizes are kept the same and no information is lost with either a stride of (1,1) or (2,2).

keras-team / keras-applications

ResNet50 stride is greater than filter size #81