x35f / unstable_baselines

Re-implementations of SOTA RL algorithms.
127 stars 12 forks source link

ADD: TRPO #40

Closed x35f closed 2 years ago