devsisters / DQN-tensorflow

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
MIT License
2.48k stars 764 forks source link

can not reproduce experiment shown in figure #17

Open zcyang opened 7 years ago

zcyang commented 7 years ago

Hi, can you share a configuration that can reproduce the results you showed on the figure? I run the default M1 configuration and only get average episodic reward at around 3.

I tried to change the configurations like setting action_repeat = 4, change learning_rate, add double_q and duel_q, there is no much change.

Many thanks!

carpedm20 commented 7 years ago

I think commit before Dec 2016 cause some problem. I'll dig into this and this is highly related to #21.

zcyang commented 7 years ago

any update on this?