QiXuanWang / LearningFromTheBest

This project is to list the best books, courses, tutorial, methods on learning certain knowledge
8 stars 1 forks source link

Prioritized Sequence Experience Replay By: Marc Brittain * 1 Josh Bertram * 1 Xuxi Yang * 1 Peng Wei #29

Open QiXuanWang opened 4 years ago

QiXuanWang commented 4 years ago

Link: Semanticscholar

Published in first in 2019 and polished in 2020.Feb This is a extension to Prioritized Experience Replay method. The question, when to use experience replay? Is it only applicable to off-policy learning? HER is to solve the sparse reward problem. So probably they are of different usage? I don't know.

Contribution: Use below equations to determine the probability of sampling transition i. image image Decay strategy image