thanks for helping me!
I encountered the difficulty when I do:
Prepare dataset: Download and put statistical files at data/binary/training_set
Prepare path/to/reference_audio (16k): By default, GenerSpeech uses ASR + MFA to obtain the text-speech alignment from reference.
thanks for helping me! I encountered the difficulty when I do:
Prepare dataset: Download and put statistical files at data/binary/training_set Prepare path/to/reference_audio (16k): By default, GenerSpeech uses ASR + MFA to obtain the text-speech alignment from reference.
what is the dataset satisfying the requirement?