Open rishikksh20 opened 5 years ago
Sorry, I am on the Spring Festival holiday, and it is a little late to see the news, I use encoder_state as the output of reference_encoder, but the paper did not specify whether to use state or output, this need to be verified by experiments, and "On paper author mentioned on ReferenceEncoder
(section 2.2 ) the output of the GRU layers passed through two separate Fully connected layers," you can find implementation on here:
https://github.com/yanggeng1995/vae_tacotron/blob/b0288f1caa776a98195dd94d1e8ea7ca6ec05f57/models/modules.py#L5-L20
@yanggeng1995 On paper author mentioned on
ReferenceEncoder
(section 2.2 ) the output of the GRU layers passed through two separate Fully connected layers, but in this implementation, last GRU state passed to two separate FC layerAs you see
encoder_state
return instead ofencoder_output
. On the other author mentioned in the same section that they used samereference_encoder
as used in GST tacotron , and I go through the best gst-tacotron implementation on github i.e. https://github.com/syang1993/gst-tacotron here alsoreference encoder
returnedencoder_output
and it working fine.Interesting thing is that on GST tacotron paper author mentioned to use last GRU state as the reference embedding. Please take note and clarify weather to take
encoder_output
orencoder_state
as the output ofreference_encoder
.Thanks