proximal-policy-optimization Search Results - Githubissues

173 results
for proximal-policy-optimization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

deepchem/deepchem #685

Proximal Policy Optimization

https://blog.openai.com/openai-baselines-ppo/ OpenAI says PPO has become their default RL algorithm. Should we get a PPO implementation going in TensorGraph? CC @peastman

rbharath updated 7 years ago
1
pytorch/pytorch #2598

Upgrading from 0.1.9 to 0.2.0 silently breaks policy gradien…

I was implementing Proximal Policy Optimization when I noticed that my Pytorch version was outdated, so I updated. To my surprise, the code I was running which worked fine in 0.1.9 was completely brok…

hyparxis updated 7 years ago
2
deeplearninc/relaax #12

Distributed PPO with L-BFGS algorithm

TODO: Distributed version of PPO - Proximal Policy Optimization (i.e., TRPO, but using a penalty instead of a constraint on KL divergence), where each subproblem is solved with L-BFGS

StanislavVolodarskiy updated 7 years ago
1

上一页 1...12 13 14 15 16 17 18...18 下一页