acetylSv / GST-tacotron

Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)
61 stars 4 forks source link

How to pass to multi-head attention? #6

Open anupam456 opened 5 years ago

anupam456 commented 5 years ago

Hi,

For condition_on_audio= False case, How to compute style_emb. What should be the GST tokens ?