AFeng-x / SMT

This is an official implementation for "Scale-Aware Modulation Meet Transformer".
https://arxiv.org/abs/2307.08579
MIT License
185 stars 15 forks source link

Missing important literature #4

Open vztu opened 1 year ago

vztu commented 1 year ago

Hi @AFeng-x, thanks for sharing the great SMT work!

I'd like to bring up another highly related hierarchical vision transformer that also deals with the scale problems: MaxViT: Multi-Axis Vision Transformer [ECCV 2022]. I'm wondering if you could also add our work to your comparison figure and tables? Thanks a lot!

AFeng-x commented 1 year ago

Hi, Thank you for your comment and for bringing up the excellent MaxViT work. I am impressed by the innovative ideas and the results presented in the paper. In the revised version of our paper, we will include your work's results in our experimental findings.