sunwuzhou03 / reinforcement-learning-lab

Some single-agent Reinforcement Learning algorithm implement, just use to learn RL algorithm.

9 stars 3 forks source link

readme

Algorithm

Implemented algorithm

dqn, ddqn, drqn

reinforce, reinforce with baseline

ac, ac with target, a2c, a2c with target, a3c, sac

ppo, ippo, multidiscrete action ppo

To-be-implemented algorithm

dueling dqn

trpo

ddpg

benchmark

custom env
- Snake-0
- Walker(BipedalWalker-v3 discrete version)
CartPole-v0
Pendulum-v1
BipedalWalker-v3
BipedalWalkerHardcore-v3