gameofdimension / policy-gradient-pong

tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/
31 stars 13 forks source link