Open kkspeed opened 6 years ago
Implement and explore the effectiveness of actor critic agent.
An effective learning method called Asynchronous Advantage Actor Critic (A3C) published by DeepMind. This algorithm beats the famous DQN by quite a margin.
https://medium.com/@henrymao/reinforcement-learning-using-asynchronous-advantage-actor-critic-704147f91686
Implement and explore the effectiveness of actor critic agent.