Edresson / YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Other
884 stars 77 forks source link

Training in 24khz Sampling Rate #39

Open chigkim opened 1 year ago

chigkim commented 1 year ago

I'm trying to train VCTK data for 24khz from scratch for hier quality. I set SAMPLE_RATE = 24000 in recipes/vctk/yourtts/train_yourtts.py. However, it loads AudioProcessor in 16k and computes speaker embeddings in 16k. Where is this 16k sample rate hard coded? Also, do I need to train vocoder first before training tts, or tts training does not rely on vocoder? Thanks!