junxnone / tio

Log

Other

10 stars 5 forks source link

RL - PPO #830

Open junxnone opened 4 years ago

junxnone commented 4 years ago

Reference

07/2017 Proximal policy optimization algorithms

Brief

基于策略梯度(PG，Policy Gradient)

junxnone commented 4 years ago

junxnone/aiwiki#290