shivammehta25 / Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
https://shivammehta25.github.io/Matcha-TTS/
MIT License
747 stars 95 forks source link

Vocoder training #87

Closed sankar-mukherjee closed 3 months ago

sankar-mukherjee commented 4 months ago

Thank you for sharing this nice work.

I have a question.

Do i have to train the hifigan from scratch or i can use may be pretrained hifigan from nvidia ngc ? If this true then do i have to change audio processing parameters to match hifigan?

shivammehta25 commented 4 months ago

Just add ckpt_path 'python matcha_tts/train.py ckpt_path='

shivammehta25 commented 4 months ago

I've used the pretrained one from HiFiGAN repo, if its any different with audio parameters then I will suggest changing the parameters to match the vocoder's parameters.

sankar-mukherjee commented 4 months ago

Thank you for clarification on both issues. Another question, can i finetune a pretrained model with small amount of data with this repo? Where should i put the pretrained model path?

shivammehta25 commented 4 months ago

Just change the filelist location in dataset and resume training with ckpt_path

shivammehta25 commented 3 months ago

I am closing this for now, feel free to reopen incase of further questions.