gentaiscool / end2end-asr-pytorch

End-to-End Automatic Speech Recognition on PyTorch
MIT License
292 stars 62 forks source link

Train my own dataset for a different language #15

Closed jafarOlamide closed 4 years ago

jafarOlamide commented 4 years ago

I want to train my dataset for a different language (Yoruba), it is also a tonal language.

gentaiscool commented 4 years ago

HI @jafarOlamide, it should work. I ran the model with Chinese, also a tonal language. You can use all Yoruba characters as the vocabulary. Please let me know if you have another questions.

jafarOlamide commented 4 years ago

so I tried training the model on Google colab and I got the following results. But got interrupted, see the image to get a picture of what I'm painting Screenshot from 2019-11-27 05-14-42

jafarOlamide commented 4 years ago

Can you also provide further research papers and articles to better understand the transformer model. Thank you.

gentaiscool commented 4 years ago

You can check the following papers: