suriyadeepan / practical_seq2seq

A simple, minimal wrapper for tensorflow's seq2seq module, for experimenting with datasets rapidly
http://suriyadeepan.github.io/2016-12-31-practical-seq2seq/
GNU General Public License v3.0
570 stars 270 forks source link

Unable to train the model #51

Open Dhrumil1808 opened 6 years ago

Dhrumil1808 commented 6 years ago

I tried training your model for around 20,000 iterations on a GPU but when I tested it on the testing data and the unseen data, it is always printing "I don't know".Can someone please help me out?

PedroPei commented 6 years ago

You've faced a common problem in seq2seq learning. Based on my experience,it's probably because the learning rate is too small for the model to get out of the 'trap' of same response.Try to use bigger learning rate like 0.2 or even 1.

asherabraham commented 4 years ago

What is the code to train the model.