Executedone / Chinese-FastSpeech2

基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏
243 stars 40 forks source link

About fine-tuning issues. #11

Open ltydd opened 1 year ago

ltydd commented 1 year ago

I plan to fine-tune my own dataset based on the AISHELL3 model, but my dataset only has 6 speakers, while AISHELL3 has 218. When loading the model, an error occurred due to the size mismatch. Additionally, Baker dataset only has one speaker, which also doesn't match with AISHELL3. I wonder how the author dealt with this issue?

Executedone commented 1 year ago

the repo is based on the Baker dataset, so speaker is limited to one, if u want more speakers to train, you can move to https://github.com/ming024/FastSpeech2