RasmusBrostroem / ConnectFourRL

0 stars 0 forks source link

Use correct states for `incremental_update()` #91

Closed jbirkesteen closed 1 year ago

jbirkesteen commented 1 year ago

We've made a mistake in TDAgent.incremental_update(). Currently, the state we use as St+1 is the state just after At. However, St+1 should be the state after opponent's move, i.e. the state in which the agent has to decide its next action (At+1).