--asr_model Hubert? - Githubissues

ZiqiaoPeng / SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

https://ziqiaopeng.github.io/synctalk/

Other

1.35k stars 161 forks source link

Closed kike-0304 closed 8 months ago

kike-0304 commented 8 months ago

我在用hubert来重新训练May数据，得到的人物嘴巴抖动很快，使用Hubert来训练需要更改其他的设置吗？
为什么hubert的self.audio_in_dim = 27，在er-nerf中是1024？ if 'esperanto' in self.opt.asr_model: self.audio_in_dim = 44 elif 'deepspeech' in self.opt.asr_model: self.audio_in_dim = 29 elif 'hubert' in self.opt.asr_model: self.audio_in_dim = 27 else: self.audio_in_dim = 32

ZiqiaoPeng commented 8 months ago