myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.
https://research.myshell.ai/open-voice
MIT License
29.87k stars 2.94k forks source link

Tone Color Embedding / Tone Color Converter training #138

Open ulvi95 opened 9 months ago

ulvi95 commented 9 months ago

Dear Developers!

In the demo_part1.ipynb, it is written that

Obtain Tone Color Embedding
The source_se is the tone color embedding of the base speaker. It is an average of multiple sentences generated by the base speaker. We directly provide the result here but the readers feel free to extract source_se by themselves.

source_se = torch.load(f'{ckpt_base}/en_default_se.pth').to(device)

Based on the fact that I have both speaker and audiofiles, how can I generate "default_se.pth" file myself?

Zengyi-Qin commented 9 months ago

The training is currently not supported