Using pre-trained weights from vision transformer

elle-miller commented 2 years ago

Hi there, thanks for the great tool!

Would it be possible to use pre-trained weights from a vision transformer, instead of resnet? I had a quick go, but quickly ran into a memory error.

I am curious if you think this would be possible or have any thoughts, before I continue.

Thank you very much,

Elle

Haochen-Wang409 commented 2 years ago

Hi, thanks for your attention and approval!

Since you have encountered an out of memory error using ViT-B/16, it is better to try reducing batch_size in config.yaml.

elle-miller commented 2 years ago

Thanks for fast response! My batch size is already =1. I was wondering if I needed to add any code to u2pl/models to get it working?

Haochen-Wang409 commented 2 years ago

I checked the number of parameters of ResNet-101 and ViT-B/16, which are 85M and 86M, respectively.

Does the original ResNet-101 work well?

Haochen-Wang409 / U2PL