Closed ytoon closed 7 years ago
I have a quetion about the LSTM weight sharing along the time dimension.
https://github.com/karpathy/char-rnn/blob/master/model/LSTM.lua#L30 https://github.com/karpathy/char-rnn/blob/master/model/LSTM.lua#L31
With the codes reallocate two new Linear NN for one time layer, it's inconsistent with LSTM mechanism that the LSTM shares weights along the time dimension.
I have a quetion about the LSTM weight sharing along the time dimension.
https://github.com/karpathy/char-rnn/blob/master/model/LSTM.lua#L30 https://github.com/karpathy/char-rnn/blob/master/model/LSTM.lua#L31
With the codes reallocate two new Linear NN for one time layer, it's inconsistent with LSTM mechanism that the LSTM shares weights along the time dimension.