Open slavakurilyak opened 6 years ago
In order to understand the potential of reinforcement learning, it is important to look at the AlphaGo Zero case study.
AlphaGo Zero is the strongest Go player in the world. It outperformed all previous versions of AlphaGo. It defeated the version of AlphaGo that won against the world champion Lee Sedol by 100 games to 0.
What was the difference between AlphaGo and AlphaGo Zero? AlphaGo was trained by supervised learning from human expert moves, and by reinforcement learning from self-play. AlphaGo Zero was trained solely on reinforcement learning, without human data.
If you want to learn more about AlphaGo Zero, watch this video (2 min).
DeepMind releases trfl, a reinforcement learning framework built on top of tensorflow. https://github.com/deepmind/trfl/blob/master/docs/index.md
Goal
As a developer, I want to develop and compare Reinforcement Learning (RL) algorithms, so that I can teach AI agents cryptocurrency trading.
Consider
Inspiration
-- Xiang Gao, 2018 (Research Paper, Source Code)