Closed einsqing closed 1 month ago
Thank you for using our model:)
As in previous research with talking head methods like ER NERF, the first 10/11ths of the total data are used for training, and the remaining 1/11th is used as the audio for inference.
How to specify the audio for reasoning?