deepchem / deepchem

Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, Materials Science and Biology
https://deepchem.io/
MIT License
5.54k stars 1.69k forks source link

Proximal Policy Optimization #685

Closed rbharath closed 7 years ago

rbharath commented 7 years ago

https://blog.openai.com/openai-baselines-ppo/

OpenAI says PPO has become their default RL algorithm. Should we get a PPO implementation going in TensorGraph?

CC @peastman

rbharath commented 7 years ago

Opened this from my phone twice by accident. Duplicate of #684