Open a897456 opened 8 months ago
One is the ground truth pitch and the other one is the predicted
One is the ground truth pitch and the other one is the predicted
Thank you for your reply. @lexkoro By the way, have you completed the conditional training? And can you share how to generate the prompt and the text just like LJSpeech dataset.
I don't think the repository is usable yet.
the first pitch in the sample() as follow: https://github.com/lucidrains/naturalspeech2-pytorch/blob/659bec7f7543e7747e809e950cc2f84242fbeec7/naturalspeech2_pytorch/naturalspeech2_pytorch.py#L1478-L1479
the second pitch in the forward() of Naturalspeech2 as follow: https://github.com/lucidrains/naturalspeech2-pytorch/blob/659bec7f7543e7747e809e950cc2f84242fbeec7/naturalspeech2_pytorch/naturalspeech2_pytorch.py#L1543-L1556