Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
MIT License
7.59k stars 756 forks source link

It's not even nearly close to good. #114

Open furqan4545 opened 11 months ago

furqan4545 commented 11 months ago

I tried to clone many voices but it failed all the time. Was just spitting out a useless cloned voice and sometime not even speaking properly.

korakoe commented 11 months ago

This is probably because this VALLE-E-X model wasn't trained on the same amount of data, not for as long. Hopefully someone trains a model on the full librilight dataset soon

korakoe commented 10 months ago

I've managed to create a finetuning colab on my fork... hopefully ill get around to training

makorihi commented 7 months ago

@korakoe did you ever manage to get good quality out of this?