Closed YangBowenn closed 9 months ago
Thank you for your attention. row_ind, col_ind = linear_sum_assignment(2.0 - sim_matrix_best) my_post = compute_trick_metrics(row_ind, col_ind) The above code is our proposed Text-Video Hungarian post-processing strategy. For detailed descriptions, please refer to Section 3 and Table 3 in our supplementary material.
Thank you for your patient response. According to the supplementary materials, it is mentioned that without using the post-processing strategy, R@1 can reach 48.2 on the MSRVTT-9k dataset. However, when I attempted to replicate the results following the parameters mentioned in the paper, I found that the result was only around 40, which is significantly different from the results presented in the paper. The loading method for the MSRVTT dataset is referenced from the TS2-Net. I am unsure where the issue may be.
Hello, I think you can try the following solutions:
Thank you for your answer. The absence of setting "--expand_msrvtt_sentences" led to the previous issue. Currently, the highest R@1 on the MSRVTT dataset can reach around 46.6, but there is still a significant gap compared to the 48.2 mentioned in the paper. It doesn't seem to be a random number issue.
Sorry, it seems this issue does exist, for example https://github.com/yuqi657/ts2_net/issues/3. In addition to the impact of the experimental environment and GPU, some settings also need to be changed. When we organize experiments on the MSR-VTT dataset, the mask ratio and random seed need to be adjusted. The random seed is 42 and the mask ratio in the frame decoder is 0.5. You can also try other values to see if it works.
Thank you very much for sharing your work. I have a question and would like to seek clarification. Why do the results differ when computed using 'compute_trick_metrics' and 'compute_metrics'? The former yields a result of 48.3, whereas the latter, under the same parameters, can only achieve 40.3. It's worth noting that 'compute_metrics' is the calculation method employed by most methods compared in the paper.