I am trying to train flashlight ASR for my native language. Before training on full dataset I was trying to set up everything on a very small dataset in Google Colab. The data I am using for setup purposes is just around 1 hour. I have taken architecture files from the wave2letter’s recipes folder. On using architecture file from conv_glu/wsj I run the training for 100 epochs but my WER always remains 100%. Then I tried decoding and it gives no output/prediction for any audio file.
What things can possibly cause this or any suggestions. Do I need to train on more data or I have to change any hyperparameters. In past I have trained models on Kaldi and I know that once a model is trained it gives some text output even though it might be wrong/inaccurate.
Hi,
I am trying to train flashlight ASR for my native language. Before training on full dataset I was trying to set up everything on a very small dataset in Google Colab. The data I am using for setup purposes is just around 1 hour. I have taken architecture files from the wave2letter’s recipes folder. On using architecture file from conv_glu/wsj I run the training for 100 epochs but my WER always remains 100%. Then I tried decoding and it gives no output/prediction for any audio file.
What things can possibly cause this or any suggestions. Do I need to train on more data or I have to change any hyperparameters. In past I have trained models on Kaldi and I know that once a model is trained it gives some text output even though it might be wrong/inaccurate.
Thanks.