Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
一共两个问题。 1.在单音色原有的基础上进行调整训练形成另一个音色。 2.如何讲上面二次训练的音色配置到模型中,并且推理阶段可以通过索引来控制使用哪种音色合成