flashlight / wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit
https://github.com/facebookresearch/wav2letter/wiki
Other
6.38k stars 1.01k forks source link

Release of online RNN-T baseline #475

Closed swpd closed 4 years ago

swpd commented 4 years ago

Hi,

Thanks for your great work of the online speech recognition. I'm also insterested in the RNN-T baseline mentioned in the paper and would like to compare it with TDS model on my own training data. According to the paper Section 4.2,

All our experiments are run using wav2letter++ framework [2] with 64 GPUs for each experiment.

Can you please consider releasing the RNN-T baseline? (RNN-T train/decode recipes, RNN-T critcriterion, etc.)

Thanks!

vineelpratap commented 4 years ago

Hi, For LC-BLSTM + RNN-T baseline, we got the trained model from the authors of https://arxiv.org/abs/1911.01629 and used it to compute the experimental results. It is not trained in Wav2letter++ framework.