Yushi-Hu / PromptCap

natual language guided image captioning
76 stars 7 forks source link

could you Provide more details on how to reproduce promptcap? #12

Open Linyuxing opened 4 months ago

Linyuxing commented 4 months ago

now i download your train data : vqa2_train_1010.zip. but how could I use this dataset via OFAsys to reproduce a model like promptcap?Such as how to load this type of dataset in OFAsys?could you please provide the yaml config file you finetune your promptcap model like this ? POPO-screenshot-20240603-134207