Labmem-Zhouyx / CDFSE_FastSpeech2

The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”
MIT License
81 stars 12 forks source link

请问需要多少说话人训练泛化性很好? #6

Open Pydataman opened 1 year ago

Pydataman commented 1 year ago

1 提出的问题,我也看了下,请问一般需要多少个说话人训练此模型可以避免这个问题?谢谢