CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time
Other
52.18k stars 8.73k forks source link

I trained a German model - which files do I need to share? #571

Closed padmalcom closed 3 years ago

padmalcom commented 3 years ago

Hi I trained encoder, synthesizer and vocoder on German female voices from m-ailabs and even if results are not too good, I'd like to share it. Encoder and vocoder folders aren't too big, but the synthesizer folder is ~10gb. Does anybody know if I can leave some files out or do I have to share the entire 10 gb folder?

ghost commented 3 years ago

Hi @padmalcom , congratulations on completing full stack training and thank you for sharing your German models with the community! For the pretrained models, this is the minimum set of files we distribute. If this is not enough guidance on which files should be selected, please ask.

encoder\saved_models\pretrained.pt
synthesizer\saved_models\logs-pretrained\taco_pretrained\checkpoint
synthesizer\saved_models\logs-pretrained\taco_pretrained\tacotron_model.ckpt-278000.data-00000-of-00001
synthesizer\saved_models\logs-pretrained\taco_pretrained\tacotron_model.ckpt-278000.index
synthesizer\saved_models\logs-pretrained\taco_pretrained\tacotron_model.ckpt-278000.meta
vocoder\saved_models\pretrained\pretrained.pt

If you modified any of the following files, please also include them:

encoder\params_model.py
encoder\params_data.py
synthesizer\hparams.py
synthesizer\utils\cleaners.py
synthesizer\utils\symbols.py
synthesizer\utils\text.py
vocoder\hparams.py
padmalcom commented 3 years ago

Thanks, filesize is now 372mb which is fantastic. Here are the most important points I learned during training. Will upload the archive soon.

First things first

How to run?

Please note that all changes I did were made for windows. You might want to adapt it for linux.

Download

If you just want to apply the model, download the bin.zip from the following release and unzip it to your repository. https://github.com/padmalcom/Real-Time-Voice-Cloning-German/releases/tag/0.1

Training

To train the model replace all code files in the repositlry by the code files from my release. You can leave out the bin.zip file from the release.

Encoder

Synthesizer

Vocoder (You really do not need to train the vocoder, it works well as the pretrained model is - at least for German)

Toolbox

Todos and Learnings

padmalcom commented 3 years ago

Model is here: https://github.com/padmalcom/Real-Time-Voice-Cloning-German/releases/tag/0.1 Please read the README.md for help.

ALIXGUSTAF commented 2 years ago

@padmalcom please can you guide me how is adding Arabic language to [Real-Time-Voice-Cloning] program step by step

joachimschwarzschild commented 1 year ago

@padmalcom i want to clone a german voice for my instagram reels can you make an video how to install it on windows?