Closed OswaldoBornemann closed 5 years ago
I haven't seen this error. It's strange error - it happens during saving of the model.
I'm using pytorch 1.0.1.post2, cuda 10. Maybe you are running out of memory on the GPU. Maybe try reducing batch size. Also, check that checkpoint path exists and writeable.
@geneing thanks my friend. Now the training is going on.
hi @tsungruihon ! How did you solve the problem? Which version of pytorch and CUDA are you using? thanks !
@maozhiqiang pytorch 1.0.1.post2
and cuda 10
thank you!
@maozhiqiang i noticed you have used Mozilla TTS
on Chinese corpus and get good results. Recently i used Mozilla TTS
too. Could i communicate with you with email ?
@tsungruihon It's a pleasure to talk with you! my email is: z_q_mao@163.com
@maozhiqiang did updating to pytorch 1.0.1.post2
work?
I'm getting the same error on Pytorch 1.1.0
Cuda 10.1
I've decreased the batch size to 64. Got same error. It's not a OOM error, I've got 16GB of memory. Changed checkpoint folders permissions, so shouldn't be a permissions issue. Any other suggestions @geneing
@acrosson @tsungruihon Please try the newly committed code. I fixed a quantization issue which was generating similar error to the one you are seeing. Make sure that the input audio files contain values that are in the range [-1, 1].
thanks a lot.!
geneing notifications@github.com 于2019年5月10日周五 下午12:58写道:
@acrosson https://github.com/acrosson @tsungruihon https://github.com/tsungruihon Please try the newly committed code. I fixed a quantization issue which was generating similar error to the one you are seeing. Make sure that the input audio files contain values that are in the range [-1, 1].
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/geneing/WaveRNN-Pytorch/issues/2#issuecomment-491156116, or mute the thread https://github.com/notifications/unsubscribe-auth/ACYV6C3BFVEPG2LJMFEQZBLPUT6IFANCNFSM4G5NEFAA .
@tsungruihon did this work for you? I got the same error, even after pulling down the latest code.
I didn't normalize, like @geneing suggested, maybe that's the issue?
@acrosson i haven't tried the latest code, busy on TTS
now. :sob:
When i ran the
train.py
, it just show thatRuntimeError: cuDNN error: CUDNN_STATUS_EXECUTION_FAILED
. But my pytorch version isv1.0.1