Open Jimenius opened 5 years ago
DQN: Y = r + gamma max(Q'(s')) Double DQN: Y = r + gamma Q'(s', argmax(Q(s')))
Dueling Architecture Q = V + (A - mean(A))
DQN: Y = r + gamma max(Q'(s')) Double DQN: Y = r + gamma Q'(s', argmax(Q(s')))