NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
Apache License 2.0
11.84k stars 2.46k forks source link

Wav2letter implementation #1668

Closed lematmat closed 3 years ago

lematmat commented 3 years ago

Describe your question: As Nemo is a Pytorch lightening module (as wav2letter) is there an easy to implement Wav2letter with Nemo ?

A clear and concise description of your question. Describe what you want to achieve. And/or what NeMo APIs are unclear/confusing.

Environment overview (please complete the following information)

Environment details

If NVIDIA docker image is used you don't need to specify these. Otherwise, please provide:

Additional context

Add any other context about the problem here. Example: GPU model

okuchaiev commented 3 years ago

Currently we do not have wav2letter implementation. Checkout Jasper and QuartzNet networks which were inspired by wav2letter

lematmat commented 3 years ago

Ok, thank you.