Closed 8bitmp3 closed 4 years ago
@seungjaeryanlee is working on this as part of a Google Summer of Code project. See his blog for progress: https://www.endtoend.ai/tags/gsoc/
Is there an update on this? Is it up for release soon / is there an early tested version ready for use?
A gentle request for a TF-Agents implementation of a modified PPO with an exploration bonus - for testing on Montezuma's Revenge.
Paper: Exploration by Random Network Distillation - Burda et al (OpenAI, University of Edinburgh).
Code (TF 1.x): https://github.com/openai/random-network-distillation/tree/master/policies