Closed skol101 closed 2 years ago
I find that even at 145k steps some even seen voices are subjectively quite far from the source voices.
Also training vocoder from the scratch, now at 140k step.s
It would be best if you found the appropriate parameters to fit your dataset. In my case, I conducted the experiment by fixing 100K.
I find that even at 145k steps some even seen voices are subjectively quite far from the source voices.
Also training vocoder from the scratch, now at 140k step.s