devsisters / DQN-tensorflow

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
MIT License
2.48k stars 764 forks source link

Possible bug of using numpy randint #42

Open bigvzhang opened 6 years ago

bigvzhang commented 6 years ago

Code of line 62 in replay_memory.py: index = random.randint(self.history_length, self.count - 1) should change to index = random.randint(self.history_length, self.count) Correct ?