czczup / ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
https://arxiv.org/abs/2205.08534
Apache License 2.0
1.27k stars 140 forks source link

hi, I want to use this framework for visual grounding tasks, can you give me some suggestions to address this? #138

Open RYHSmmc opened 1 year ago