Rongjiehuang / GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
MIT License
315 stars 45 forks source link

where is the dataset #28

Open YueMing-github opened 11 months ago

YueMing-github commented 11 months ago

thanks for helping me! I encountered the difficulty when I do:

Prepare dataset: Download and put statistical files at data/binary/training_set Prepare path/to/reference_audio (16k): By default, GenerSpeech uses ASR + MFA to obtain the text-speech alignment from reference.

what is the dataset satisfying the requirement?

ahadjawaid commented 10 months ago

Hey were you able to figure this out?