carpedm20 / deep-rl-tensorflow

TensorFlow implementation of Deep Reinforcement Learning papers
MIT License
1.59k stars 396 forks source link

Small installation instructions enhancement #4

Closed rhaps0dy closed 8 years ago

rhaps0dy commented 8 years ago

Plus minor nitpicks and likely bugs.

Also, I am working on an implementation of Async Actor-Critic, which I read somewhere you have in progress. If you're also working on it now, please tell me so we can coordinate and avoid duplicate work.

carpedm20 commented 8 years ago

@rhaps0dy Thanks for fix. Yes I'm working on A3C and you can see the codes at https://github.com/devsisters/async-rl-tensorflow. But before finishing it, I had a hard time debugging this repo and https://github.com/devsisters/DQN-tensorflow/ to add Double Q-learning, Duelling network, and set the environment which is exactly same as Nature paper.

There is non-convergence issue at https://github.com/devsisters/DQN-tensorflow/ which contains:

I created multiple repos but this repo, https://github.com/carpedm20/deep-rl-tensorflow is the most cleanest and my favorite. So you are interested in implementing A3C and is there any codes you wrote since now?

rhaps0dy commented 8 years ago

Hello,

Well, I'm glad this is the cleanest repo. I wrote an implementation of A3C, but it works really badly. So far the code is just at my fork of this repo, https://github.com/rhaps0dy/deep-rl-tensorflow/ . If you execute the command for debugging in the README, it learns, but slowly and it doesn't get much reward. Note the stupidly small learning rate. Perhaps you can identify what is wrong?

What do you mean, a non-convergence issue? In this repo? It seems to work fine for me.