sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
MIT License
3.75k stars 837 forks source link

a bug in DQN.py #15

Open karlhjm opened 4 years ago

karlhjm commented 4 years ago

https://github.com/sweetice/Deep-reinforcement-learning-with-pytorch/blob/2ad8b9f4d20c1f680f30f0627300ee65167291cd/Char01%20DQN/DQN.py#L134

in my opinion,it is variable 'ep_reward' that should be stored here instead of 'reward'