Hi. I decided to use Kur to try and train a model based on a Russian-language corpus (about 8k transcribed input utterances). I had to increase the vocab parameter, but other than that, it started training.
However I wonder if I need to anticipate further changes to account for the cyrillic alphabet? I just ran a test evaluation after one epoch, and the gibberish I got was in latin.
Hi. I decided to use Kur to try and train a model based on a Russian-language corpus (about 8k transcribed input utterances). I had to increase the vocab parameter, but other than that, it started training. However I wonder if I need to anticipate further changes to account for the cyrillic alphabet? I just ran a test evaluation after one epoch, and the gibberish I got was in latin.
Any ideas of the changes required?
Regards Christo