Hangz-nju-cuhk / Talking-Face_PC-AVS

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
Creative Commons Attribution 4.0 International
916 stars 169 forks source link

unconsistent duration time between mp3 and mp4 in LRW #41

Open renrenzsbbb opened 3 years ago

renrenzsbbb commented 3 years ago

Thanks for your great work. I use your prepared video to test your model, there is no problem. Howerer, when I test the video in LRW, it does not work. And then, I find that the original mp4 duration time is 1.16s, but it change to 1.21s after converting to mp3 by ffmpeg, can you give me some advice? Thanks in advance.

Hangz-nju-cuhk commented 3 years ago

You can try neglecting the last 0.05s of the mp3 file. Such small durations normally cannot be observed by humans. The mismatched durations are usually filled with silence.