tencent-ailab / V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
2.2k stars 277 forks source link

Getting different results #8

Closed piyushK52 closed 3 months ago

piyushK52 commented 4 months ago

I am trying to run the samples given in the code, but the outputs are not like the ones shown in the examples. Could you let me know if this is due to an issue with the installation on my part, or does it require manual adjustment of the settings?

https://github.com/tencent-ailab/V-Express/assets/34690994/63652d18-e88b-4176-98a8-6f5dc10f59f4

https://github.com/tencent-ailab/V-Express/assets/34690994/89e65b26-1b5e-4523-ac3e-8c2dfbf5c6da

tiankuan93 commented 4 months ago
python inference.py \
    --reference_image_path "./test_samples/emo/talk_emotion/ref.jpg" \
    --audio_path "./test_samples/emo/talk_emotion/aud.mp3" \
    --kps_path "./test_samples/emo/talk_emotion/kps.pth" \
    --output_path "./output/test/talk_emotion_aud_result_0.95_2.0.mp4" \
    --retarget_strategy "no_retarget" \         # no need to retarget
    --reference_attention_weight 0.95 \
    --audio_attention_weight 2.0 \
    --num_inference_steps 25

https://github.com/tencent-ailab/V-Express/assets/19601425/be7d869f-08b1-46b8-b389-b5235ac7221e

python inference.py \
    --reference_image_path "./test_samples/emo/talk_emotion/ref.jpg" \
    --audio_path "./test_samples/emo/talk_emotion/aud.mp3" \
    --kps_path "./test_samples/emo/talk_hb/kps.pth" \     # change this
    --output_path "./output/test/talk_emotion_aud_hb_result_0.95_2.0.mp4" \
    --retarget_strategy "naive_retarget" \      # need retarget
    --reference_attention_weight 0.95 \
    --audio_attention_weight 2.0 \
    --num_inference_steps 25

https://github.com/tencent-ailab/V-Express/assets/19601425/6377cd28-acc0-4eff-95ec-edc761702680