hirofumi0810 / tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
MIT License
313 stars 120 forks source link
asr attention-mechanism automatic-speech-recognition beam-search csj ctc end-to-end end-to-end-learning joint-ctc-attention librispeech speech-recognition speech-to-text tensorflow timit timit-dataset

TensorFlow Implementation of End-to-End Speech Recognition

Requirements

Corpus

TIMIT

LibriSpeech

CSJ (Corpus of Spontaneous Japanese)

These corpuses will be added in the future.

This repository does'nt include pre-processing and pre-processing is based on this repo. If you want to do pre-processing, please look at this repo.

Model

Encoder

Connectionist Temporal Classification (CTC) [Graves+ 2006]

Options

Attention Mechanism

Decoder
Attention type
Options

Usage

Please refer to docs in each corpuse

Lisense

MIT

Contact

hiro.mhbc@gmail.com