Hidden states of the model/rnn

Hello,

I was wondering if there is a reason why you are not using for the dual path extension (DPE) the hidden states from the previous DPE but just initializing all of them to zero (So not just initializing the first dpe and using the out_state from the first dpe for the second dpe hidden state input).

Also, are you then using the out_hidden_state in each batch or each epoch? I was wondering where you actually initialize this: "in_hidden_state = [[torch.zeros(1, batch * num_bands, inter_hiddensize//groups) for in range(groups)] for _ in range(num_modules)]"

Thank you.

gitwukeyi / FSPEN

Hidden states of the model/rnn #5