openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
MIT License
26.23k stars 3.35k forks source link

Fine tuning #398

Open Rohinivv96 opened 1 year ago

Rohinivv96 commented 1 year ago

HI, I would like to fine tune CLIP model for custom dataset so that it will detect the object in an image given a image as a a prompt. Can anyone suggest me any steps to be followed for it or any references for the same ?

ijazul-haq commented 1 year ago

@Rohinivv96 same question

Rickylht commented 1 year ago

use open_clip

BaochaoZhu commented 8 months ago

HI, I would like to fine tune CLIP model for custom dataset so that it will detect the object in an image given a image as a a prompt. Can anyone suggest me any steps to be followed for it or any references for the same ?

same question