flashlight / wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit
https://github.com/facebookresearch/wav2letter/wiki
Other
6.35k stars 1.02k forks source link

100% wer while training and no prediction on decoding #1009

Open halts440 opened 2 years ago

halts440 commented 2 years ago

Hi,

I am trying to train flashlight ASR for my native language. Before training on full dataset I was trying to set up everything on a very small dataset in Google Colab. The data I am using for setup purposes is just around 1 hour. I have taken architecture files from the wave2letter’s recipes folder. On using architecture file from conv_glu/wsj I run the training for 100 epochs but my WER always remains 100%. Then I tried decoding and it gives no output/prediction for any audio file.

What things can possibly cause this or any suggestions. Do I need to train on more data or I have to change any hyperparameters. In past I have trained models on Kaldi and I know that once a model is trained it gives some text output even though it might be wrong/inaccurate.

Thanks.