Using Bottleneck features

CookiePPP / cookietts

[Last Updated 2021] TTS from Cookie. Messy and experimental!

BSD 3-Clause "New" or "Revised" License

43 stars 8 forks source link

Using Bottleneck features #35

Closed kannadaraj closed 3 years ago

kannadaraj commented 3 years ago

In the hparams file you have mentioned that use_memory_bottleneck=True,# False baseline. You have indicated that for training a baseline we should set it to false there by resulting in repetition of axillary features to every spectrogram frame. Just wanted to be sure if i need to set to false for baseline training? Thanks for the help.

CookiePPP commented 3 years ago

False would match original tacotron2. The feature is meant to decrease VRAM usage by mixing the speaker embeddings and torchMoji embeddings in the encoder section of the model and decreasing the dimension of the input to the decoder.

kannadaraj commented 3 years ago

Thanks for the comment. Will keep it true.