Open pathway opened 6 years ago
I'm so sorry for my late reply... I didn't notice your post.
Thank you very much for your advice! I'm in trouble because MERLIN is too complex to make converged. Trying to the simpler agents seems nice!
I'm busy at a graduate school entrance exam but it will end soon:) I'll try to the implementation after it!
Thank you yosider for sharing this very interesting repo!
Would you consider implementing simpler agents "RL-LSTM" and/or "RL-Mem" from the paper? They would make good baselines and are simpler to follow.