Wanted to know if there is an MAE based implementation for Vit Adapter ?

czczup / ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

https://arxiv.org/abs/2205.08534

Apache License 2.0

1.27k stars 140 forks source link

Open ayushnangia opened 3 months ago

ayushnangia commented 3 months ago

Request: Implement a Masked Autoencoder (MAE) based version of ViT-Adapter for segmentation tasks.
Rationale: Leverage MAE's self-supervised learning capabilities to potentially improve segmentation performance.
Query: Has there been any consideration or work on integrating MAE with ViT-Adapter for segmentation?