hunkim / ReinforcementZeroToAll

249 stars 132 forks source link

refactor: DQN docstring & fix divergence #21

Closed kkweon closed 7 years ago

kkweon commented 7 years ago
  1. Make it as close as implemented in the paper
  2. Fix slow iteration (tf.Operation inside a loop is too slow)
  3. Clean up codes & remove api_key

Result

2013 ver Game Cleared within 287 episodes with avg reward 199.18

2015 ver Game Cleared in 265 episodes with avg reward 199.39