Bug in AttentionDecoderCell ? ?

Dear authors, I have three questions as follows:

Firstly,

noted: output_shape=(input_dim + hidden_dim,) should it be ->-> output_shape=(input_length, input_dim + hidden_dim)???

Additionally, the same issue with #219 ,

instead of this:

shouldn't it be this (input_dim -> hidden_dim):

C = Lambda(lambda x: K.repeat(x, input_length), output_shape=(input_length, hidden_dim))(c_tm1)

Because c_tm1 has dimensionality of hidden_dim

Finally,

line 89: alpha = W3(_xC) Here, did "Dense" play the same role as "TimedistributedDense" in keras? ?

farizrahman4u / seq2seq