ViTAE-Transformer / ViTAE-VSA

The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"
https://arxiv.org/abs/2204.08446
157 stars 9 forks source link

pre-trained model of Swin + VSA #6

Open lyu124 opened 1 year ago

lyu124 commented 1 year ago

hello, i am now doing a work about semantic segmentation and i would like to use swin with vsa model as the feature extractor backbone. I would like to ask, about when will you maybe upload the pre-trained model? Without pretrained model the results are really with low accuracy now... Thank you so much.