Matt suggested to stack some seq2seq encoders to give the model more depth; this does so in a similar manner to the BiDAF code.
There's a funky error going on right now wrt masking the 4D input to the seq2seq encoder, and I can't quite figure it out. It complains that the mask is 2d, and the input itself is 4d...perhaps we need a EncoderWrapper specialized for seq2seq_encoders?
Matt suggested to stack some seq2seq encoders to give the model more depth; this does so in a similar manner to the BiDAF code.
There's a funky error going on right now wrt masking the 4D input to the seq2seq encoder, and I can't quite figure it out. It complains that the mask is 2d, and the input itself is 4d...perhaps we need a
EncoderWrapper
specialized forseq2seq_encoders
?