NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
Apache License 2.0
11.84k stars 2.46k forks source link

[Question] Is there any plan for ASR Quartznet v3? #1217

Closed ORlGlN closed 4 years ago

ORlGlN commented 4 years ago

Really appreciate what you guys did here, common voice dataset have been updated to en_1932h_2020-06-22, is there any plan to release Quartznet v3 with better performance and wer in near future?

okuchaiev commented 4 years ago

thanks @ORlGlN - we'll have a look at updated MCV. @redoctopus FYI