Closed chasing-ant closed 3 weeks ago
A similar issue is: https://github.com/X-LANCE/VoiceFlow-TTS/issues/11#issuecomment-2084334819
@chasing-ant This length mismatch is a common phenomenon, and you can overcome this by truncating or padding the features to the same length. In your case, as mel is one frame shorter than durations, the recommended solution is to zero-pad the mel sequence by 1 frame. I am not sure whether the numpy RuntimeWarning will affect the result (intuitively it won't), but at least padding or truncating before training can avoid such warnings.
A similar issue is: #11 (comment)
@chasing-ant This length mismatch is a common phenomenon, and you can overcome this by truncating or padding the features to the same length. In your case, as mel is one frame shorter than durations, the recommended solution is to zero-pad the mel sequence by 1 frame. I am not sure whether the numpy RuntimeWarning will affect the result (intuitively it won't), but at least padding or truncating before training can avoid such warnings.
I'll give it a try, thank you for your detailed explanation.
Hi,thanks for your great work. I'm having trouble running the following command in terminal:
python train.py -c configs/lj_16k_gt_dur.yaml -m lj_16k_gt_dur
But the following error occurs:I changed this line of code to
abs(sum(dur) - mel.shape[1]) <= 1
and it works, but I don't know if it has any effect on the result. Appears during operationand