-
First of all, thanks for your contribution. Recently I am working for some recurrence experiments of this paper. However I encounter some problems that I am not quite sure. In the google driver these'…
-
hi, could you tell the way to split the msr-vtt dataset? Many thanks!!
-
run a set of benchmarks on an aggregation model:
MSR-VTT, MSVD etc.
-
Thanks for your extraordinary work of video-text retrieval with T2VLAD.
Here, I have a little request about this work: could you share the other dataloaders, configs of MSR-VTT at 1k-A split, MSVD an…
-
hey, Really thank you for your work firstly!
I have checked the dataset' json files that you provided. But I am quite confused. Could you tell me how to download them? Can you provide some scripts? …
-
Thanks for your work!
Could you upload the model's pretrained checkpoint file?
I want to test with the weights file to caption video input.
Thank you
-
Msr vtt dataset have 10000 videos and 20 captions for each video but in this implementation only a video-caption pair in train phase is considered. Therefore in total
-
Hi,
I want to train the model using the MSR-VTT dataset. And it tells me that I need a pkl file but I can only find the mp4 and txt files. So how can I tranfer them to or maybe to find the pkl file.
-
I read in the readme file, paligemma can captioning a short video, anyone can guide me to do that?
Does it extract every frames on the video? Or does the paligemma tokenizer directly support video…
-
In the paper, "So we assess the models previously trained on MSR-VTT using the MSVD test set" refers to training with the entire data set of MSRVTT, and testing the model with the test set of MSVD(670…