Generate data that we can use to show that the environment simulation functions independent of the agent

interpreting-rl-behavior / interpreting-rl-behavior.github.io

Code for the site https://interpreting-rl-behavior.github.io/

Creative Commons Attribution 4.0 International

0 stars 0 forks source link

This is actually very important and is something I'm not totally confident about.

If we can't provide a convincing demonstration that this works (that the environment simulation function independent of the agent), then it's back to training the gen model in such a way that it works (e.g. by using multiple agents, some of which are trained, some of which are not).

The quality of the generations for the demos for the AISC presentation were okay but not ideal. We haven't yet checked it for this generative model. Assigning this to @danbraunai . IIRC you did this for the presentation too.

Once the demos are produced, then put them in the article.

interpreting-rl-behavior / interpreting-rl-behavior.github.io

Generate data that we can use to show that the environment simulation functions independent of the agent #25