auspicious3000 / autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
https://arxiv.org/abs/1905.05879
MIT License
996 stars 207 forks source link

About decoder mismatch #7

Closed wade3han closed 5 years ago

wade3han commented 5 years ago

Hi, I have a question about the decoder mismatch between code and the paper.

In the paper, it used 3 LSTM layers after 3 convnorm layers. However in the code, it used 1 LSTM, 3 convnorm layers are following after, and 2 LSTM layers are used next.

Is this mismatch intended? I am wondering which implementation is correct. Thanks.

auspicious3000 commented 5 years ago

Code is more up-to-date