keon / policy-gradient

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

MIT License

159 stars 43 forks source link

deep-reinforcement-learning keras policy-gradient reinforcement-learning

Policy Gradient

Minimal implementation of Stochastic Policy Gradient Algorithm in Keras

This PG agent seems to get more frequent wins after about 8000 episodes. Below is the score graph.

score