Closed OscarVanL closed 4 years ago
https://github.com/TensorSpeech/TensorFlowTTS/blob/master/examples/fastspeech2_libritts/fastspeech2_dataset.py#L119 => here you should load mapper from your processor mapper (as processors were added later this isn't done in master branch feel free to fix it :P ) Tmp fix is to just use this subfunction as a mapper of speakers.
OK, so if I'm understanding correctly, when you process the dataset it creates libritts_mapper.json
, but then when training the model this is not used and speaker IDs are re-generated, therefore not matching those in libritts_mapper.json
?
OK, so if I'm understanding correctly, when you process the dataset it creates
libritts_mapper.json
, but then when training the model this is not used and speaker IDs are re-generated, therefore not matching those inlibritts_mapper.json
?
Ye when u train from example folder you are not loading mapper from processor (as u can see in #TODO all over the place :P)
@GavinStein1
Hi, just letting you know what I told you in https://github.com/TensorSpeech/TensorFlowTTS/issues/296#issuecomment-723727913 about the speaker_id
was not correct, due to this bug. I've made a PR which seems to fix it though :)
Hi,
I'm having some difficulties with inference on my FastSpeech2 model. I do not think the speaker IDs at inference correlate with those in
libritts_mapper.json
.For example, my speakers_map says
"2999": 65
. Speaker 2999 is British male, but when doing inference with speaker_id 65, the voice is one of a Female American speaker instead.My speakers_map says
"8382": 49
, Speaker 8382 is a British female, but when doing inference with speaker_id 49 I get an American MaleIf I use my own speaker
"1": 50
, I get a British sounding female. The speaker is made from recordings of my own voice (male), so this is also incorrect.I think somehow the speaker IDs are getting mixed up. Is this something you've seen before?