Get all required target functions working

leesharkey commented 2 years ago

Value (high and low)
Value (large increases or decreases)
actions
hx neurons
hx directions

leesharkey commented 2 years ago

Demoting the urgency of this because I think it'll likely take too long to fix.

Basically the optimization is just extremely unstable. It's kind of working for most except IC directions.

My main hypothesis as to why:

There are multiple discrete variables in the latents (the discrete categorical vars in the RSSM and also the action space). This means that slight changes in the bottleneck vector can lead to very different samples.

Potential future solution:

Use Gaussian variables instead of categoricals in the RSSM

leesharkey commented 2 years ago

We're not going to do target functions. Dataset examples will suffice

interpreting-rl-behavior / interpreting-rl-behavior.github.io

Get all required target functions working #61