Open Zrduan724 opened 9 months ago
Hi, thanks for your great work.
I would like to know how to prepare the "mesh sequence" input data when performing inference with unseen audio data?
More specifically, if I have one wild audio clip without mesh sequence, would the learned facial animation style only be consistent with the subject styles in the training set? Because I can only use the mesh sequence in training set.
Hi, thanks for your great work. I would like to know how to prepare the "mesh sequence" input data when performing inference with unseen audio data?
More specifically, if I have one wild audio clip without mesh sequence, would the learned facial animation style only be consistent with the subject styles in the training set? Because I can only use the mesh sequence in training set.
yes, usually during inference, we leverage mesh sequence in training set as style reference
Hi, thanks for your great work.
I would like to know how to prepare the "mesh sequence" input data when performing inference with unseen audio data?