[MB_Melgan] Why is a model trained only generator is better than trained on both?

TensorSpeech / TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

https://tensorspeech.github.io/TensorFlowTTS/

Apache License 2.0

3.76k stars 803 forks source link

[MB_Melgan] Why is a model trained only generator is better than trained on both? #797

Closed ggpid closed 11 months ago

ggpid commented 11 months ago

Why is a model trained only on the generator(200k step) better in quality than a model trained on both the generator and discriminator(1M step)?

I'm training multiband melgan fine tuning with my own dataset which is in korean and about 40munutes and i used kss pretrained model with --pretrained param

trained generator only (200k step) : audio
audio trained generator and discriminator(1M step) : audio