stefanpantic / asr

Automatic speech recognition using neural networks
GNU General Public License v3.0
19 stars 1 forks source link
asr automatic-speech-recognition common-voice jasper librispeech machine-learning neural-networks python quartznet tensorflow

ASR - Automatic Speech Recognition

Automatic Speech Recognition using neural networks. This repo contains implementations of NVIDIA's Jasper and QuartzNet speech recognition architectures. Their approach doesn't rely on RNNs like most do, but rather opts for a fully convolutional approach.
NOTE: You'll have to unzip the ffmpeg binaries manually because we had to push them as zip archives due to Git LFS bandwith limitations. Just do Extract Here in their appropriate directories.

The research papers can be found on Arxiv:

:mortar_board: Authors: