Open HongyuZhu999 opened 5 months ago
I'm curious as to why down sampling is used, because it will reduce data infomation. Would it be better if I didn't use it?
It's a cute question. I just inherit the code from Swin-Transformer and using the architecture.
You can try the architecture of plian ViT to check whether VMamba works under that structure.
I'm curious as to why down sampling is used, because it will reduce data infomation. Would it be better if I didn't use it?