An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960
BSD 3-Clause "New" or "Revised" License
114
stars
32
forks
source link
Remove unnecessary context concatenation of attention at pre-net #19
https://github.com/nii-yamagishilab/tacotron2/pull/12
Affects ExtendedTacotronV1Model and DualSourceSelfAttentionTacotronModel