cyoon1729 / deep-Q-networks

Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51
258 stars 77 forks source link

the script does not converge to a solution #3

Open laz8 opened 4 years ago

laz8 commented 4 years ago

https://github.com/cyoon1729/deep-Q-networks/blob/cb3f1551bc927fedf7166d7b0b3834aaff07d32e/test_gym/dueling_dqn.py#L13

Episode 399: 12.0 Episode 400: 10.0 Episode 417: 11.0 ... Episode 997: 19.0 Episode 998: 11.0 Episode 999: 10.0

And why are you not using a target network?

seolhokim commented 4 years ago

Hi, I found similar issue in perdqn that done sign is not used. so i solved this issue in the same way.