Why not conduct the experiment to directly compare the pooling and DW convolution

sail-sg / poolformer

PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)

Apache License 2.0

1.3k stars 117 forks source link

Hi @ydhongHIT , thanks for your attention. The target of our paper is to demonstrate that the competence of transformer models primarily stems from the general architecture MetaFormer. To achieve this target, we finally select the most simple token mixer pooling to demonstrate MetaFormer. The token mixer comparison of pooling and dw conv seems not very conform to the target of the paper. We plan to release more MetaFormer models with different token mixers (eg DW Conv) around March. More experiment results may be added to this paper of future revised version or be reported in a new tech report.

sail-sg / poolformer

Why not conduct the experiment to directly compare the pooling and DW convolution #23