mrahtz / tensorflow-rl-pong

Pong AI trained using policy gradient-based reinforcement learning
51 stars 21 forks source link