devsisters / DQN-tensorflow

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
MIT License
2.49k stars 762 forks source link

Possible bug of using numpy randint #42

Open bigvzhang opened 7 years ago

bigvzhang commented 7 years ago

Code of line 62 in replay_memory.py: index = random.randint(self.history_length, self.count - 1) should change to index = random.randint(self.history_length, self.count) Correct ?