Use correct states for `incremental_update()`

RasmusBrostroem / ConnectFourRL

0 stars 0 forks source link

Use correct states for `incremental_update()` #91

Closed jbirkesteen closed 1 year ago

jbirkesteen commented 1 year ago

We've made a mistake in TDAgent.incremental_update(). Currently, the state we use as S_t+1 is the state just after A_t. However, S_t+1 should be the state after opponent's move, i.e. the state in which the agent has to decide its next action (A_t+1).