NVIDIA / OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
https://nvidia.github.io/OpenSeq2Seq
Apache License 2.0
1.54k stars 372 forks source link

Transfer learning #467

Closed flassTer closed 5 years ago

flassTer commented 5 years ago

Is it possible to start transfer learning with more characters in the vocabulary file than the ASR model was already trained on?

blisc commented 5 years ago

We have never tried this. I can only offer some starting points that may or may not work.

Depending on what you are trying to accomplish, it might simply be easier to either hard-code or train a neural LM to convert the model predictions to the desired transcripts. Eg, words -> numbers, dollar -> $, etc.

flassTer commented 5 years ago

Thank you @blisc