Proximal Policy Optimization

deepchem / deepchem

Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, Materials Science and Biology

https://deepchem.io/

MIT License

5.54k stars 1.69k forks source link

Closed rbharath closed 7 years ago

rbharath commented 7 years ago

OpenAI says PPO has become their default RL algorithm. Should we get a PPO implementation going in TensorGraph?

CC @peastman

rbharath commented 7 years ago

Opened this from my phone twice by accident. Duplicate of #684