germain-hug / Deep-RL-Keras

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
528 stars 149 forks source link

DDPG - LunarLanderContinuous #15

Open cevans3098 opened 5 years ago

cevans3098 commented 5 years ago

Great work on the implementation - very clean code and easy to follow

I have been running the LunarLanderContinuous Environment

python main.py --type DDPG --env LunarLanderContinuous-v2 --render

I have not been able to get it converge. I have been running for >4000 episodes, but I have not seen any improvements. score bounces around -320 to -480.... and clearly the lander is not making progress.

score

I am using a newer version of Keras (2.2.4) - I had errors initially, but was able to resolve them using the comments from @zynk13 (https://github.com/germain-hug/Deep-RL-Keras/issues/2) All parameters (lr, network structure) are the same as the original code

Anyone able to achieve good results with the lunar lander?

Thanks