some confuse for "Pre-trained CLIP-Text encoders for multiple languages"

FreddeFrallan / Multilingual-CLIP

OpenAI CLIP text encoders for multiple languages!

MIT License

746 stars 69 forks source link

some confuse for "Pre-trained CLIP-Text encoders for multiple languages" #6

Open moluchase opened 3 years ago

moluchase commented 3 years ago

if i have <other language text , image, label> pair data, can i directly use 'distilbert-base-multilingual-cased' to pre-train clip model? Why re-train a model for englist to other languages ?

ZiboZ commented 2 years ago

I guess because the image-text matching task is done on English corpus only.