Hi, I have a question about the decoder mismatch between code and the paper.
In the paper, it used 3 LSTM layers after 3 convnorm layers.
However in the code, it used 1 LSTM, 3 convnorm layers are following after, and 2 LSTM layers are used next.
Is this mismatch intended? I am wondering which implementation is correct.
Thanks.
Hi, I have a question about the decoder mismatch between code and the paper.
In the paper, it used 3 LSTM layers after 3 convnorm layers. However in the code, it used 1 LSTM, 3 convnorm layers are following after, and 2 LSTM layers are used next.
Is this mismatch intended? I am wondering which implementation is correct. Thanks.