mlfoundations / model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
MIT License
426 stars 38 forks source link

checkpoint for ViT-g/14 #9

Closed sorobedio closed 1 year ago

sorobedio commented 1 year ago

Where to get the checkpoints of vit-g/14 models? In the code i only saw code for VIT-B/32 base. could you show me how to load the ViT-G/14?

mitchellnw commented 1 year ago

The ViT-G/14 models are from https://arxiv.org/abs/2106.04560 and not available publicly.

There are open source efforts trying to train public ViT-G/14 zero-shot models, https://laion.ai/blog/giant-openclip/, but these models came after the soups paper so we did use them.

sorobedio commented 1 year ago

ok thank you