lucidrains / CoCa-pytorch

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
MIT License
1.03k stars 88 forks source link

Any plan to share the pre-trained models? #2

Open sbkim052 opened 2 years ago

sbkim052 commented 2 years ago

Hi, thanks for sharing this wonderful code. Do you have any plans to share the pre-trained models?

shizhediao commented 2 years ago

same question

dragen1860 commented 2 years ago

+1

dwlmt commented 2 years ago

Same question. I'm looking for a model that can perform retrieval as in CLIP or ALIGN and also caption images so would be good if the pretrained weights were made available.

rom1504 commented 2 years ago

If you want them make them, there are no such pretrained model in existence with this repo

gabrielsantosrv commented 2 years ago

Same question. I'm looking for a model that can perform retrieval as in CLIP or ALIGN and also caption images so would be good if the pretrained weights were made available.

@dwlmt I am looking for the same model :/

yxding95 commented 2 years ago

+1, Hope the authors release the pre-trained model, Thanks!

hpeiyan commented 2 years ago

+10086

willyfh commented 2 years ago

+1

beybars1 commented 2 years ago

It would be great, if authors share the pre-trained models.

mexicantexan commented 2 years ago

+9999

Tangnameless commented 2 years ago

+1

Ella2le commented 2 years ago

+1

AnonymousDestroyer commented 2 years ago

+99

adeetyapatel12 commented 2 years ago

+5555

xs1997zju commented 1 year ago

+1

rom1504 commented 1 year ago

I think there is a confusion here :)

This repo is a reproduction repo. The author of the repo does not have any pretrained model.

If you want to reproduce coca models, you can use this repo (and a lot of compute power) to train some

TjuJianyu commented 1 year ago

+999

yoon28 commented 1 year ago

+1

Timotheeee commented 1 year ago

it's made by google so obviously they won't be sharing the models

germanjke commented 1 year ago

I guess we can take ViT weights from ViT-Pytorch library, but NLP part of the model contains many Transformer Blocks, so we don't have weights for them. Looks like we can't get fully weights for CoCa right now.

rom1504 commented 1 year ago

if someone feels like plugging https://github.com/lucidrains/CoCa-pytorch/blob/main/coca_pytorch/coca_pytorch.py into https://github.com/mlfoundations/open_clip code base, I could quickly train it

byougert commented 1 year ago

+1

zhangj1an commented 1 year ago

+1

rom1504 commented 1 year ago

https://twitter.com/gpuccetti92/status/1621874887844315139?t=l-j6Iv_5d9A48d0XPzvUQg&s=19 done by @gpucce