tencent-ailab / V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
2.26k stars 281 forks source link

about the gt_generate_compare.mp4 #21

Closed NinoNeumann closed 5 months ago

NinoNeumann commented 5 months ago

Is this gt_generate_compare.mp4 the result of overfitting?

tiankuan93 commented 5 months ago

The training process is about learning a mapping relationship so that you can get the desired result when inference. If the model fits the training data well and also gets good results in the test data, this is exactly what we want the model to do.

Furthermore, if you look carefully at the area around the face in the video, you will see that there is still a lot of "generated" content in the generated video.