Integrate a pre-trained Speech2Vec embedding layer for Speech To Text models

NVIDIA / OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

https://nvidia.github.io/OpenSeq2Seq

Apache License 2.0

1.54k stars 371 forks source link

Closed jsvir closed 5 years ago

jsvir commented 5 years ago

Train an embedding using all audio data in the train set - optional setting in config file
Use it in the data layer instead of audio features only