Closed maytusp closed 1 year ago
Hi, maytusp, have you got a possible 16K config? I had tried to change only the sample-rage config and keeps others unchanged, but in the training process ,the evaluation step generated audio are always had bad duration prediction, the speech speed is very slow, and the total audio length of the generated is far more then that of the corresponding GT.
same problem here!
16k Sample Rate Error >
Hi @wizardk, I have the same problem as yours. Do you know what parameters to be adjusted for 16k?
Originally posted by @maytusp in https://github.com/MasayaKawamura/MB-iSTFT-VITS/issues/7#issuecomment-1357225003