Fictionarry / TalkingGaussian

[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
https://fictionarry.github.io/TalkingGaussian/
213 stars 28 forks source link

推理时输入全是0的音频,嘴依然是微张的,这个有没有办法解决呢? #21

Open anliyuan opened 2 months ago

anliyuan commented 2 months ago

推理时输入全是0的音频,嘴依然是微张的,这个有没有办法解决呢? 20240722170732

Fictionarry commented 2 months ago

属于是模型没学到静音时应该闭嘴,可以在素材里多添加点静音且闭嘴的片段,或者找一个对应于闭嘴动作的audio feature作为静音时输入的feature