interpreting-rl-behavior / interpreting-rl-behavior.github.io

Code for the site https://interpreting-rl-behavior.github.io/
Creative Commons Attribution 4.0 International
0 stars 0 forks source link

Confirm that actions, agent hx, and observations are aligned in the gen model #39

Closed leesharkey closed 2 years ago

leesharkey commented 2 years ago

Compare it with the order in which they are recorded. I'm especially unsure about hx alignment.

leesharkey commented 2 years ago

Also check reward and done/terminal alignment with everything else too

leesharkey commented 2 years ago

Done primarily in f7843f216e4cf713c8430122cddca1aded4f147b