-
Hi! I want to test video retrieval with `all-in-one-base.ckpt` on MSR-VTT and see the metrics to compare with the paper. Can you please help with the command?
I tried the following command but it sta…
-
您好,我加载您在didemo数据集上微调的模型参数(ret_didemo_l16_25m.pth)在didemo测试集上复现结果,得到的检索结果如下图:
![image](https://github.com/OpenGVLab/unmasked_teacher/assets/55133120/e80c41b1-0aae-48d5-92ed-a77cdf66827f)
与您论文中的结果不一致,…
-
Thank you for sharing your code.
Could you please provide additional details regarding the inference speed calculation in Fig. 2 and Table 3? I am a bit confused.
Regarding Table 3, where the i…
-
Hello author, I noticed that your paper mentioned the use of flipping, but I noticed that the data augmentation in the code only used CenterCrop without flipping. I have some doubts. Can you publicly …
Li524 updated
11 months ago
-
(ms_sl) root@acd91bbbb0c2:/home/ms-sl-main# ./do_test.sh $DATASET $FEATURE $ROOTPATH $MODELDIR
2023-06-22 08:24:36.436:INFO:__main__ - Setup config, data and model...
------------ Options ----------…
-
When run the code train, I use the sample TextVideo with the data is MSRVTT, to implement, run the config
```
CACHE_DIR= '/root/.cache'
TRAIN_DATA = '/content/MSRVTT_data.json'
# this script is fo…
-
Hello, thank you very much for your work and code, I have a few questions that I am puzzled about, and I hope you can help me.
I trained the small diet many times according to the parameters you gave…
-
When I try to reproduce the demo I run into problems with java, I tried jre-17 and jre-19 but it doesn't seem to be a problem with the java version.
Is there any good solution for this please?
`…
-
I trained clip4clip on didemo dataset, and the R@1 of text-to-video is much worse than that shown in paper.
The metric reported in paper is 43.4 on DiDeMo when similarity calculator is `meanP` and…
-
Hello, where can I get missing metadata files like txt_msvd-retrieval.json, args_msrvtt-qa.json, or args_msvd-retrieval.json, etc? Should I make those by myself?
In addition, how can I evaluate qa? I…