issues
search
quangr
/
jax-rl
jax version of ppo algorithm in mujoco enviroment, achieve SOTA(tianshou)
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
The impact of random shuffle
#4
quangr
opened
1 year ago
0
Add recurrent neural network
#3
quangr
opened
1 year ago
2
reproduce ppo benchmark
#2
quangr
closed
1 year ago
0
Add env wrapper
#1
quangr
opened
1 year ago
0