video-generation-evaluation Search Results

yochaiye/LipVoicer #5

The result of LSE-C GT (Ground Truth) on the LRS2 dataset I …

Excellent work！ Amazing LipVoicer! I have a small question about the evaluation metric of sync: LSE-C and LSE-D. In [LIPVOICER: GENERATING SPEECH FROM SILENT VIDEOS GUIDED BY LIP READING](https…

MyBeautiful-Fantasy updated 2 days ago

Vchitect/VBench #61

Cogvideo2B score

Hi, I see that the total score of cogvideo2B on Leaderboard is 80.94%, but after I use all_dimension_long. txt to inference, the total score measured is only 78.68%. The video I produced with cogvide…

CacacaLalala updated 3 weeks ago

McGill-NLP/mcgill-nlp.github.io #324

update publication

### Action Update publication ### Title Evaluating In-Context Learning of Libraries for Code Generation ### Shorthand icl-libraries ### Author Arkil Patel ### Names Arkil Patel, Siva Reddy, D…

arkilpatel updated 4 months ago

IVG-SZ/Flash-VStream #2

Unable to reproduce the results reported in your paper

Hi Authors, Thanks for your great work first! It's an amazing contribution to the video understanding task! However, when I try to reproduce the results reported in the paper, I get several trou…

ShaneeyS updated 2 months ago

THUDM/CogVideo #194

Work plan and enhancement / 工作计划和用户诉求

Tasks that have been identified and scheduled: + Fine-tuning support for Diffusers version models + Adaptation for CPU / NPU inference frameworks (e.g., Huawei, Intel devices) + ComfyUI adaptat…

zRzRzRzRzRzRzR updated 5 hours ago

showlab/Show-1 #15

Do you have a plan to release the evaluation code of SHOW-1 …

Hi, nice work! Do you have a plan to release the evaluation code of SHOW-1 in UCF-101 and MSRVTT? If you can open source the evaluation code, I believe that future work can be fairly compared to sh…

Lauren-wh updated 7 months ago

BenchCouncil/AIGCBench #2

how to use the "Flow-Square-Mean" in Motion effects

I can't find the function or any other files to use Flow-Square-Mean ?

mamianshusheng updated 3 months ago

johndpope/VASA-1-hack #3

Roadmap

Expanding the provided code to fully recreate the VASA-1 system as described in the research paper would require a significant amount of additional code and architectural changes. Here's a high-level …

johndpope updated 4 months ago

neulab/prompt2model #316

Support for non-text modalities (images, speech, video)

Currently prompt2model is limited to text input text output tasks. The underlying framework can certainly handle different modalities, and it would be great to see prompt2model be able to handle diffe…

neubig updated 6 months ago

katesanders9/multimodal-proofs #1

Dialogue retrieval index construction

# Overview TBD # Progress - [X] Establish repo with some likely useful [NELLIE](https://github.com/nweir127/guided_inference) code - [x] Implement TVQA HF dataset on BRTX - [x] Set up TVQA even…

katesanders9 updated 1 year ago

861 results for video-generation-evaluation

861 results
for video-generation-evaluation