Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Apache License 2.0
4.69k stars 703 forks source link

请问用于训练的声音为4分钟, 能训练出较好的效果吗? #511

Open mikeyang01 opened 9 months ago

mikeyang01 commented 9 months ago

目前发现4分钟训练出来的效果不好, 尤其是长文本, 咬字非常不清晰