Open naajeehxe opened 2 months ago
From my understanding, you used 1k samples for the test set. To accurately reproduce the results from the paper, could you please provide the sample IDs used for the test set?
Yes, me too. While trying to reproduce the results. I couldnt find the files mentioned by @naajeehxe plus the following file: 'datasets/annotations_all/msvd_caption/train.jsonl' It would be great if you could let us know how to generate the same.
@idj3tboy I’m not sure if this will be helpful, but I’d like to share how I did it. I downloaded the data from (https://cove.thecvf.com/datasets/839) and used the following two txt files for the train/test split:
• MSRVTT/videos/train_list_new.txt
• MSRVTT/videos/test_list_new.txt
As a result, I got 7010 train data and 2990 test data. I’m not exactly sure what the 9k/1k train/test data used in the paper refers to, but I was able to reproduce results similar to the paper using this 7k/3k train/test split.
If you’re in a hurry, it might be a good idea to give it a try!
Thank you for your wonderful project!
Could you provide the train/test split JSON files for the MSR-VTT caption dataset? I am unable to access the following files: