X-PLUG / mPLUG-2

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Apache License 2.0
220 stars 18 forks source link

Is there a way to adapt the model to cartoon video clips? #7

Open aartykov opened 1 year ago

aartykov commented 1 year ago

Hi! Is there a way to adapt the model to cartoon video clips? I do not have cartoon-caption video dataset, however I am searching for a possible way of finetuning the model with cartoon-caption image dataset?