yiranran / Audio-driven-TalkingFace-HeadPose

Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)
https://ieeexplore.ieee.org/document/9894719
721 stars 146 forks source link

Unable to reproduce on custom video #33

Open dipam7 opened 4 years ago

dipam7 commented 4 years ago

I followed the Colab tutorial and could get it running. I am now trying to do the same on a custom video and audio. When fine-tuning audio net, we do

!cd Audio/code/; python train_19news_1.py 32 0

When I run this I get

32 lack frame0.mat
32 lack frame1.mat
32 lack frame2.mat
...
...
32 lack frame298.mat
32 lack frame299.mat
not all 300 frames are reconstructed successfully

My video is called 32.mp4 and I used ffmpeg to make sure it is 25 fps. Other than that I haven't modified anything in the notebook. Where is it going wrong?

Also, if anyone was able to reproduce it on a custom video, do share the notebook

prajwaljpj commented 3 years ago

@dipam7 Were you able to reproduce it on their video?

dipam7 commented 3 years ago

@prajwaljpj No I wasn't able to