AkiraTOSEI / ML_papers

ML_paper_summary(in Japanese)
5 stars 1 forks source link

NEVER GIVE UP: LEARNING DIRECTED EXPLORATION STRATEGIES #96

Open AkiraTOSEI opened 3 years ago

AkiraTOSEI commented 3 years ago

TL;DR

Propose an NGU that uses integrated search rewards for multiple episodes and single episodes each, and It was. The former uses RND, while the latter uses embedded vectors and kNN to find new states. It scored high in pitfall, Montezuma's Revenge. image

Why it matters:

Paper URL

https://arxiv.org/abs/2002.06038

Submission Dates(yyyy/mm/dd)

Authors and institutions

Methods

Results

Comments