RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!
MIT License
20.62k stars 3.18k forks source link

African Language Phonemes #2057

Open paulerbear opened 1 month ago

paulerbear commented 1 month ago

Hi!

I'm trying to train RVC models for a range of dialects in my country. I live in South Africa.

I'm struggling with click-consonant phonemes.

These African language sounds are not understood as phonemic by the model, so transforming voice-to-voice using recordings of African language speakers results in dropped phonemes or incorrect phonemes.

Click consonants are not reliably preserved in the output and are even occasionally replaced by other consonants ("c", "ck", "k").

Is there anyway for me to overcome this roadblock? Do I need to include more of the click-consonants in my training data?

Click Consonant Transformation.zip

fumiama commented 1 month ago

@RVC-Boss

blaisewf commented 1 month ago

try training the model without pretraineds and adding more data

arbaz686 commented 4 days ago

Add more examples of click consonants in your dataset and modify the tokenizer to treat click consonants as distinct units.