:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Why is a model trained only on the generator(200k step) better in quality than a model trained on both the generator and discriminator(1M step)?
I'm training multiband melgan fine tuning with my own dataset which is in korean and about 40munutes
and i used kss pretrained model with --pretrained param
Why is a model trained only on the generator(200k step) better in quality than a model trained on both the generator and discriminator(1M step)?
I'm training multiband melgan fine tuning with my own dataset which is in korean and about 40munutes and i used kss pretrained model with --pretrained param
trained generator only (200k step) : audio
audio trained generator and discriminator(1M step) : audio