Closed Ca-ressemble-a-du-fake closed 1 year ago
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.
I can't replicate the issue using the VCTK glow-tts recipe with the latest version. It shows 109 speakers in the resulting congif.json as it should be.
Describe the bug
Hi,
I am following the Multispeaker training documentation on VITS model. So I added the lines
I also checked the value of
config.num_speakers
which was correct (4 speakers) but then the generated config.json shows"num_speakers": 0
whereas it should be 4. Moreover in the end when synthetizing speech it does not list any speakers.To Reproduce
Take a VITS recipe, set 4 datasets in LJSpeech layout with a custom formatter to retrieve a column with the speaker name as:
Add a speaker manager, print the reported number of speakers found (it should be 4), then launch the training.
Navigate to the output path and open the generated config.json. Scroll down to
num_speakers
field, it is 0 instead of 4.Expected behavior
num_speakers
in the generated config.json file should be set in accordance to what is set in python file (aka recipe).Otherwise if
config.num_speakers
(in python) is only used for some prior computation and discarded afterwards then it should be stated somewhere : " X speakers detected but setting num_speakers to 0 in generated config.json because [eg you need to provide d vector file]"Logs
No response
Environment
Additional context
No response