DylanCope / Multi-Agent-RL-with-TF

Training intrinsically motivated, independent Q-learners to play Tic-Tac-Toe
https://dylancope.github.io/Multiagent-RL-with-TFAgents/
11 stars 5 forks source link