Closed kuto5046 closed 4 years ago
Fortunato, Meire, Azar, Mohammad Gheshlaghi, Piot, Bilal, Menick, Jacob, Osband, Ian, Graves, Alex, Mnih, Vlad, Munos, Remi, Hassabis, Demis http://arxiv.org/abs/1706.10295
深層強化学習のNNの重みにパラメトリックノイズを加えたNoisyNetを提案。ノイズのパラメータも学習することで探索効率を向上。従来の探索手法(entropy-reward,ε-greedy)をNoiseNetに置き換えることでDQN,A3Cの性能が向上。
Fortunato, Meire, Azar, Mohammad Gheshlaghi, Piot, Bilal, Menick, Jacob, Osband, Ian, Graves, Alex, Mnih, Vlad, Munos, Remi, Hassabis, Demis http://arxiv.org/abs/1706.10295