Jimenius / InternRL

0 stars 0 forks source link

Add Double DQN to DQN implementation #20

Open Jimenius opened 5 years ago

Jimenius commented 5 years ago

DQN: Y = r + gamma max(Q'(s')) Double DQN: Y = r + gamma Q'(s', argmax(Q(s')))

Jimenius commented 5 years ago

Dueling Architecture Q = V + (A - mean(A))