Github code (decoder GRU layer) different from paper

ryankiros / skip-thoughts

Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"

2.05k stars 544 forks source link

Hi,

I'm stepping through the code and noticed that de GRU layer of the decoder doesn't take into account the context provided by the encoder. As stated in the paper, eq 5, 6 and 7, an extra parameter is added to incorporate the context every time step during decoding.

I think the code is missing the following function (taken from neural machine translation example): https://github.com/kyunghyuncho/dl4mt-material/blob/master/session1/nmt.py#L352

I can add it myself but I'm afraid that I will miss out something and therefore won't be able to get the same results. Especially, given that it should train for 2 weeks...

Best,

Fréderic

ryankiros / skip-thoughts

Github code (decoder GRU layer) different from paper #8