jaywalnut310 / glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search
MIT License
660 stars 151 forks source link

Runtime Error: Multi speaker #56

Open ppanja opened 3 years ago

ppanja commented 3 years ago

Hi @jaywalnut310 , I've the following configuration for multi-speaker and I am getting the below error. I've done all other necessary changes as highlighted here in different posts like changing to TextMelSpeakerLoader, TextMelSpeakerCollate and correcting the unpacking etc.

RuntimeError: Given groups=1, weight of size 256 512 3, expected input[1, 256, 89] to have 512 channels, but got 256 channels instead

Multi Speaker Configuration:

"batch_size": 64 "hidden_channels": 256, "filter_channels": 1024, "filter_channels_dp": 256, "n_speakers": 24, "gin_channels": 256 "hidden_channels_enc": 256, "hidden_channels_dec": 256

Training records - 9102, Val records - 100

Can you please help in solving this? Please let me know if you suggest any other configuration.

Thanks in advance.

Regards, Prasanta

ankurdhuriya commented 2 years ago

I have made necessary changes to run in multi-speaker setting https://github.com/ankurdhuriya/glow-tts