guilk / VLC

Research code for "Training Vision-Language Transformers from Captions Alone"
33 stars 4 forks source link

Thank you for your code! If the pre-trained checkpoint of bert embeding is avaiable? #5

Closed senmaoy closed 2 years ago

guilk commented 2 years ago

Hi, we train our models from scratch. You can find the trained word embeddings in our provided weights vlc_largeset.ckpt.