Normally, when decoding with an LSTM, the output at time t-1 is used as the input at time t. Instead, I want the input at time t to be [output{t-1} f(output{t-1})], where f is feedforward neural network. Does anyone know how to cleanly incorporate this into the code?
Normally, when decoding with an LSTM, the output at time t-1 is used as the input at time t. Instead, I want the input at time t to be [output{t-1} f(output{t-1})], where f is feedforward neural network. Does anyone know how to cleanly incorporate this into the code?