Open foxbeing7 opened 6 months ago
The code is for raw videos, and you can upload the video paths in your own dataset directly
The code is for raw videos, and you can upload the video paths in your own dataset directly
thanks, if i plan to finetune MovieChat in my own dataset, how should build dataset and any advice for training ?
MovieChat is training free. For more training details, please refer to https://github.com/DAMO-NLP-SG/Video-LLaMA
Nice work! As title states , If I want to build my own dataset , how do I extract visual features and generate image captions and video captions? Thank you