llyx97 / FETV

[NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin Liu, Lei Li, Shuhuai Ren, Rundong Gao, Shicheng Li, Sishuo Chen, Xu Sun, Lu Hou
48 stars 2 forks source link

Which UMT models used in UMTScore? #4

Closed KaiyueSun98 closed 5 months ago

KaiyueSun98 commented 5 months ago

Hi, thanks for your great work, I would like to confirm which UMT model (finetuned stage) you used to calculate the UMTScore? Video-text retrieval or VQA 1 2

llyx97 commented 5 months ago

Thanks for your attention to our work! We adopt the version fine-tuned on video-text retrieval.

KaiyueSun98 commented 5 months ago

Thanks for your attention to our work! We adopt the version fine-tuned on video-text retrieval.

Many thanks!