as-ideas / ForwardTacotron

⏩ Generating speech in a single forward pass without any attention!
https://as-ideas.github.io/ForwardTacotron/
MIT License
578 stars 113 forks source link

Need instructions for fine tunning #74

Closed kafan1986 closed 2 years ago

kafan1986 commented 2 years ago
  1. I have already trained the TTS using the Fast Pitch implementation with my custom data. Now I want to fine tune the vocoder (Hifigan), which was already trained on the same speaker data. Do I need to perform fine tunning? Would that increase the quality as I believe vocoder will learn to generate audio from not perfect mels (generated from fast pitch generated mels, rather from the ground truth itself).
  2. If yes, can I generate Mels from the Fast Pitch for fine tuning the downstream vocoder? If yes, how to generate the same length and aligned Mels? Need some instructions. Thanks you!