Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
请问deepspeech2提取音频特征在代码中是encoder出来的就是音频特征了么?decoder是吧提取到的音频特征和解码成文字么?