Closed yangleituitui closed 2 months ago
Hi Yanglei, what is the audio you utilized for inference? If the audio differs a lot with training audios, such issue can happen due to the model generalization ability.
Thank you very much for your answer. I have already solved the problem; it was an error with the downloaded voice model. I would also like to ask, where is the facial frame rate originally at 60 FPS converted to 15 FPS
Hi Yanglei, you can downsample the facial sequences to 15 FPS. This is the original setting in the BEAT paper.
Hello, I'm working with the BEAT dataset to infer facial expressions and gestures from speech, and after visualizing with Blender, the mouth opens too wide and cannot close. What could be the reason for this issue?