-
Dear authors
can you share your extracted features for msvd and msr-vtt, and their extracted settings?
Btw, I also want to extract these features for videos by myself. Therefore, it would be be…
-
When I run `make -f Makefile_msrvtt_svo train GID=0 EXP_NAME=xe FEATS="irv2 c3d category" BFEATS="roi_feat roi_box" USE_RL=0 CST=0 USE_MIXER=0 SCB_CAPTIONS=0 LOGLEVEL=DEBUG MAX_EPOCH=100 LAMBDA=20`
…
-
Hi, just wondering when the zip for the MSRVTT data will be uploaded? Thanks!
-
Hi, I saw in the paper says the text features are: pretrained word2vec word embeddings and then passed through a pretrained OpenAI-GPT model. I usually see people using word2vec or GPT alone, instead …
-
Thank you for your amazing work! Nevertheless, I still have some questions about your motivation.
As you mentioned in your paper(the abstract section): _**actions generated by existing methods may …
-
Hi, thanks for the wonderful work.
I want to caption my own videos giving the video frames (without transcript), can I use the pretrained weight (`univl.pretrained.bin`) provided in the repository di…
-
Dear author, thanks your work. When I download MSRVTT dataset, there are some urls broken. How do you solve it, thanks.
-
When I run to get the return information from predictor.resize().
"RuntimeError: requested resize to (320, 240) ((320, 240) elements in total), but the given tensor has a size of 1024 (1024 element…
-
Hi,
I am trying to reproduce the msr-vtt retrieval benchmark results on the full test set.
I download the pre-trained weighted and config from the msr-vtt benchmark sheet and only change the split_…
-
hello, I have some questions.
1.When you do 'Text-to-Video Retrieval' experiment on MSRVTT 1K test set, you do not pre-training the model end-to-end on `Howto100M` dataset. It is because the computat…