SpeechColab / GigaSpeech

Large, modern dataset for speech recognition
Apache License 2.0
649 stars 62 forks source link

Add icefall RNN-T results #113

Closed wgb14 closed 2 years ago

wgb14 commented 2 years ago

We would like to add results from Icefall RNN-T recipe. The numbers are comparable to those form Conformer CTC with ngram & attention rescoring, but decoding speed is much faster.

it takes only 1 minute to decode the dev dataset

https://github.com/k2-fsa/icefall/pull/318#issuecomment-1125625918

csukuangfj commented 2 years ago

https://github.com/k2-fsa/icefall/pull/361 has an updated version for the results of modified beam serach

10.52/10.62 -> 10.50/10.60

chenguoguo commented 2 years ago

Great, great, could you adjust the numbers, and rank it accordingly?

wgb14 commented 2 years ago

Will do more decoding experiments and update results here

csukuangfj commented 2 years ago

Will do more decoding experiments and update results here

You can also try --iter xx --avg xx instead of --epoch xx --avg xx.

wgb14 commented 2 years ago
Update results here: Dev Test
WER 10.4 10.51

This RNN-T model outperforms other models.

chenguoguo commented 2 years ago

Great, merging.