Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
I used logits from another model (Wav2Letter) to test the CTC decoder, however, I can't get correct results from the decoder, for example,
I got :"sometimemesttheeultbotthfehinatetwhorkhasreallygod" from the decoder while the greedy decoder gave me "sometimd es the ou but of fan at work is really god".
I am wondering why the CTC decoder will concatenate all the characters?
Hi,
I used logits from another model (Wav2Letter) to test the CTC decoder, however, I can't get correct results from the decoder, for example, I got :"sometimemesttheeultbotthfehinatetwhorkhasreallygod" from the decoder while the greedy decoder gave me "sometimd es the ou but of fan at work is really god". I am wondering why the CTC decoder will concatenate all the characters?
Thanks,