facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Other
7.34k stars 1.22k forks source link

[Question] about CNN backbone #46

Closed CaptainEven closed 1 year ago

CaptainEven commented 2 years ago

Is it possible to replace a ViT backbone with a regular CNN backbone like Resnet?

gaopengpjlab commented 2 years ago

You can check the following hybrid conv-transformer MAE architecture for your interest.

https://github.com/Alpha-VL/ConvMAE

keyu-tian commented 1 year ago

Is it possible to replace a ViT backbone with a regular CNN backbone like Resnet?

Perhaps this is what you're looking for: MAE on standard ResNets or ConvNeXts: "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

CaptainEven commented 1 year ago

Thanks for the advice!