Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
目前测试下来mb_melgan_csmsc的推理速度满足离线环境下CPU实时推理,其他的VOC模型都太慢,但是我用fastspeech2_mix微调克隆了一个男生音色后,使用mb_melgan_csmsc推理音色就不对,只能使用aishell3数据集训练的VOC模型,而mb_melgan只有csmsc数据集训练的模型,问题来了,如何使用aishell3数据训练一个mb_melgan模型,求指点