-
Thanks for the paper and the open sourcing the code base.
I would like to know how evaluation is performed on the MSR-VTT dataset for zero shot text to video retrieval.
* Are the metrics reported…
-
Hello, the following problems occurred while the code was running, but I am not sure where this/img/msrvtt is or where the address was passed. In the initial get_ video_ retrieval_ args() function als…
-
How can I train the model by myself?
-
## タイトル: T2VIndexer: 効率的なテキスト-動画検索のための生成的動画インデクサー
## リンク: https://arxiv.org/abs/2408.11432
## 概要:
現在のテキスト-ビデオ検索手法は、主にクエリとビデオ間のクロスモーダルマッチングに依存して、類似度スコアを計算し、そのスコアでソートして検索結果を得ています。この手法は、各候補ビデオとクエリのマッ…
-
大佬您好,冒昧打扰。
我看了您做的diffusionret的工作,思路非常的好, 在对评测指标里面我对里面的代码比较疑惑,
在进行msvd数据集技能型评估时,
在main_retrieval的595行sim_matrix = new_t2vmatrix这里,为什么这里不直接采用
indices = torch.argsort(sim_matrix , dim=1, descending=T…
-
Hi,
Thanks for your code. I found there is a gap comparing the recorded results (on the paper or the repo) after I exactly followed the "test" code. Here are my results:
MSVD:
RESULTS: Bleu_1: …
-
Hi, Can I get your captioning result?
-
Can you share some recordings of your experiments like some graphs in neptune.ai or other logs tracking the performance/loss changes in training steps.
I would like to compare the effects of some c…
-
Hi, I met a bug, but I haven't fixed it. I want to use this code to train model on the MSVD dataset. I can train the model on the MSR-VTT. So I prepare the caption.json and info.json based on default …
-
Thanks for you work on this project!
I followed the instructions in the readme to get your code running, and I wasn't able to reproduce the results from the paper:
MSVD:
RESULTS: Bleu_1: 0.858…