TheoCoombes / ClipCap

Using pretrained encoder and language models to generate captions from multimedia inputs.
95 stars 13 forks source link

Evaluation using pre-trained model #8

Open uu95 opened 1 year ago

uu95 commented 1 year ago

Hello,

I love your work, really impressive stuff! I'm working on something similar and was wondering if you might have a pretrained model I could play around with for some basic tests.

Thanks!

LemonZhong commented 1 year ago

Same requests!