Rayhane-mamah / Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation
MIT License
2.25k stars 911 forks source link

Point of #495

Open harismeharis opened 3 years ago

harismeharis commented 3 years ago

I have a basic and probably stupid question. What is the point of using a WaveNet Vocoder to invert the mel spectrogram feature representation into time-domain waveform since we can just use audio.inv_mel_spectrogram and get the audio (voice output) directly from the encoder part?