Open mutonix opened 5 months ago
Could you please propose a pr in the dataset section?
Vript is a fine-grained video-text dataset with 12K annotated high-resolution videos (~400k clips), where each clip has a detailed caption of ~145 words.
isn't that a non-commercial dataset?
It is non-commercial and academic only.
Could you please propose a pr in the dataset section?
@mutonix
Vript is a fine-grained video-text dataset with 12K annotated high-resolution videos (~400k clips), where each clip has a detailed caption of ~145 words.