RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!
MIT License
19.71k stars 3.05k forks source link

Why does my RVC model produces bad results? #2025

Open GUUser91 opened 3 weeks ago

GUUser91 commented 3 weeks ago

I created a RVC model and I had it set to 50 epochs for training. It produces bad results. Here is the audio file I used for training, I used version v2 and I set the target sample rate to 48k. https://files.catbox.moe/440g5r.wav Here is the input file I used. https://vocaroo.com/11ANq5W9xFd5 Here is the output file. https://vocaroo.com/136PewYoZmL3

HotChocut commented 3 weeks ago

Your training data is good but you need more of it, 40 seconds is not enough to train a good RVC model. Aim for 10 minutes if possible but the more the merrier, I've done models with upwards of 4 hours worth of training data so there really is no such thing as "too much data". I personally would also train with more epochs (around 200) but most of the time I just go with the shotgun approach and do 1000 which is DEFINITELY overkill and I would not recommend unless you're crazy like I am.