Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
在aistudio上运行Text-To-Speech FastSpeech2 + Parallel WaveGAN on CSMSC 下载nltk_data很慢 希望能下载到百度自己的服务器,然后通过一个download脚本加速下载过程