Closed deguodedongxi closed 4 months ago
Yes, you need a multilingual version of style to do that. The issue here is that the PL Bert is just an embedding model for the phonemes, as the model itself is only trained on english speech, you'll need to retrain it on the language you want.
Do you know, if there are any pretrained style models for that? Pretraining a style on a multi lingual corpus from scratch will require a lot of resources, I guess.
As far as i know, there are no multilingual pretrained models, however there are a few models here and there, you have ShoukanLabs that has an english model trained for expressivity https://dagshub.com/ShoukanLabs/Vokan and are currently working on a multilingual one.
Thank you so much! Dagshub's work looks really impressive! I will definitely follow up on their progress!
Hello everyone,
I tried to exchange the pretrained englich BERT model with the multilingual PL-BERT Model to generate speech with the LibriTTS Notebook. For me, the results did not really work out as I expected.
What I did:
global_phonemizer = phonemizer.backend.EspeakBackend(language='de', preserve_punctuation=True, with_stress=True)
orglobal_phonemizer = phonemizer.backend.EspeakBackend(language='fr-fr', preserve_punctuation=True, with_stress=True)
The result still sounds as if the model tries to pronounce the german or french sentence with an english pronounciation.
Did I forget a step to use the BERT Model correctly? Thanks in advance!