Open kvinwang opened 6 years ago
If you have a slow GPU like me, and had to reduce the batch size from 16 to 8 or even 4, then keep in mind that the training process will be slower. I had the same problem, but It was not a Kur problem, it was patience instead. I left it training all day long and 5 hours later, at approximately 8-10 epochs I started to see some consonants and spaces as output, not words but something was there!! If you let it train longer it will start outputting some words and looking like the original audio.
After taking a deeper look at the number of samples per epoch, I see you are training on the default dataset (which is very small so you won't see a prediction that will make much sense in the end) Also I notice your computer takes more than 4 hours to complete a single epoch, which is TOO slow. I guess you are training with your CPU. You need to use Tensorflow-gpu with an appropriate GPU, otherwise you will need weeks or even months of training to obtain some intelligible predictions on a bigger dataset.
I am following the guide and ran
kur -v train speech.yml
. after 3 epochs, the prediction is still empty. What's wrong?