Goal directed - Githubissues

yosider / merlin

(Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760

24 stars 5 forks source link

My understanding is Merlin targets "goal-directed behaviours". For example in the videos the agent repeatedly finds ways to a specific goal.

However in the memory game, the cards are shuffled each time, which does require memory but there is no sense of static "goal".

My question: Is merlin certainly applicable to the memory game, even though there is no known static goal state? In the code I see memory accumulating across episodes, but I think that means past memories (from previous episodes) are not useful.

Please forgive my ignorance if I am mistaken, also I know this is a work in progress. Your sharing is very much appreciated.

yosider / merlin

Goal directed #2