Python inference to convert wav to text

flashlight / wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Other

6.37k stars 1.01k forks source link

Python bindings supports only featurization and beam-search decoding where predictions from the network are provided. So that people who trained acoustic models with tensorflow/pytorch could reuse beam-search decoding from python.

Wav2letter is c++ and models trained with it can be used for inference with Decode.cpp binary.

Here we have colab example (all tutorials are here) with recent codebase how one can do inference with the CTC acoustic model and some n-gram language model. So this is binary call with interactive regime, so that you can pass path to audio and transcription will be printed on the screen, and so on.

flashlight / wav2letter

Python inference to convert wav to text #944