Open xhran2010 opened 6 years ago
How can I train the model by myself?
There are lots of video description datasets on the websites. Such as https://www.microsoft.com/en-us/research/publication/msr-vtt-large-video-description-dataset-bridging-video-language-supplementary-material/.
How can I train the model by myself?