kkspeed / chess

Chinese chess with bot
MIT License
1 stars 1 forks source link

Implement Actor Critic Agent #4

Open kkspeed opened 6 years ago

kkspeed commented 6 years ago

Implement and explore the effectiveness of actor critic agent.

thekingofkings commented 6 years ago

About AC

An effective learning method called Asynchronous Advantage Actor Critic (A3C) published by DeepMind. This algorithm beats the famous DQN by quite a margin.

https://medium.com/@henrymao/reinforcement-learning-using-asynchronous-advantage-actor-critic-704147f91686