Closed piyushK52 closed 3 months ago
python inference.py \
--reference_image_path "./test_samples/emo/talk_emotion/ref.jpg" \
--audio_path "./test_samples/emo/talk_emotion/aud.mp3" \
--kps_path "./test_samples/emo/talk_emotion/kps.pth" \
--output_path "./output/test/talk_emotion_aud_result_0.95_2.0.mp4" \
--retarget_strategy "no_retarget" \ # no need to retarget
--reference_attention_weight 0.95 \
--audio_attention_weight 2.0 \
--num_inference_steps 25
https://github.com/tencent-ailab/V-Express/assets/19601425/be7d869f-08b1-46b8-b389-b5235ac7221e
talk_emotion's
reference image and talk_hb's
target video, and the audio still uses talk_emotion
. Then you need to execute it with the following scriptpython inference.py \
--reference_image_path "./test_samples/emo/talk_emotion/ref.jpg" \
--audio_path "./test_samples/emo/talk_emotion/aud.mp3" \
--kps_path "./test_samples/emo/talk_hb/kps.pth" \ # change this
--output_path "./output/test/talk_emotion_aud_hb_result_0.95_2.0.mp4" \
--retarget_strategy "naive_retarget" \ # need retarget
--reference_attention_weight 0.95 \
--audio_attention_weight 2.0 \
--num_inference_steps 25
https://github.com/tencent-ailab/V-Express/assets/19601425/6377cd28-acc0-4eff-95ec-edc761702680
I am trying to run the samples given in the code, but the outputs are not like the ones shown in the examples. Could you let me know if this is due to an issue with the installation on my part, or does it require manual adjustment of the settings?
https://github.com/tencent-ailab/V-Express/assets/34690994/63652d18-e88b-4176-98a8-6f5dc10f59f4
https://github.com/tencent-ailab/V-Express/assets/34690994/89e65b26-1b5e-4523-ac3e-8c2dfbf5c6da