TMats / survey

Summary of Paper Survey
https://tmats.github.io/survey/
16 stars 2 forks source link

Deep Successor Reinforcement Learning #72

Open TMats opened 6 years ago

TMats commented 6 years ago

https://arxiv.org/abs/1606.02396

TMats commented 6 years ago

RL algorithms fall into two main classes: (1) model-free algorithms that learn cached value functions directly from sample trajectories, and (2) model-based algorithms that estimate transition and reward functions, from which values can be computed using tree-search or dynamic programming. However, there is a third class, based on the successor representation (SR), that factors the value function into a predictive representation and a reward function.

TMats commented 6 years ago
2017-12-12 17 19 39