Open vztu opened 1 year ago
Hi, Thank you for your comment and for bringing up the excellent MaxViT work. I am impressed by the innovative ideas and the results presented in the paper. In the revised version of our paper, we will include your work's results in our experimental findings.
Hi @AFeng-x, thanks for sharing the great SMT work!
I'd like to bring up another highly related hierarchical vision transformer that also deals with the scale problems: MaxViT: Multi-Axis Vision Transformer [ECCV 2022]. I'm wondering if you could also add our work to your comparison figure and tables? Thanks a lot!