Broken Prioritized Experience Replay

Kautenja / playing-mario-with-deep-reinforcement-learning

An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.

MIT License

68 stars 12 forks source link

Broken Prioritized Experience Replay #33

Open Kautenja opened 6 years ago

Kautenja commented 6 years ago

results on Pong seem to indicate that the experience replay functionality is not working correctly. Performance is terrible, and the agent is far worse than its vanilla alternative. The paper introducing this technique will need reviewed to locate the source of the learning error. A code review is necessary to understand performance limitations and improve the runtime.

Kautenja commented 6 years ago

https://github.com/openai/baselines/blob/master/baselines/deepq/replay_buffer.py