Closed kuto5046 closed 4 years ago
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller arXiv (2013), Nature (2015)
CNNを用いてゲームの画面から学習し、強化学習のベンチマークであるAtariゲーム3つで人間を上回る性能を示す。Experience Replay,報酬のclipping,frame skipping,Fixed Target Q-Network等の工夫により学習を安定化。
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller arXiv (2013), Nature (2015)