qgallouedec / fake_repo_for_issue_form

0 stars 0 forks source link

[Question] My question title #2

Closed qgallouedec closed 1 year ago

qgallouedec commented 1 year ago

This report was last updated on Fri Jan 13 12:50:38 2023. To generate it, use this python script.

Total benchmark progress: ⬛⬛⬛⬛⬛⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 978/2690 (36.4%)

Link to openrlbechmark: https://wandb.ai/openrlbenchmark/sb3

To contribute

Install RL Baselines3 Zoo

pip install rl_zoo3 --upgrade

and run the following command

python -m rl_zoo3.train --algo <ALGO> --env <ENV> --eval-episodes 20 --n-eval-envs 5 --track --wandb-entity openrlbenchmark --wandb-project-name sb3

A2C

Use --algo a2c.

Environment Runs
Acrobot-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Ant-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
AntBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
BipedalWalker-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
BipedalWalkerHardcore-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
CartPole-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 12/10
HalfCheetah-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HalfCheetahBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Hopper-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HopperBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Humanoid-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
LunarLander-v2 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
LunarLanderContinuous-v2 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
MountainCar-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
MountainCarContinuous-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Pendulum-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
ReacherBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Swimmer-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Walker2DBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Walker2d-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
BeamRiderNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
BreakoutNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
EnduroNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
PongNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
QbertNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SeaquestNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SpaceInvadersNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10

ARS

Use --algo ars.

Environment Runs
A1Jumping-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
A1Walking-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
Acrobot-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 19/10
Ant-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
AntBulletEnv-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
BipedalWalker-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
BipedalWalkerHardcore-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
CartPole-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 12/10
HalfCheetah-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HalfCheetahBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Hopper-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HopperBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Humanoid-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
LunarLander-v2 ⬛⬛⬛⬛⬛⬛⬛⬜⬜⬜ 7/10
LunarLanderContinuous-v2 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
MountainCar-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
MountainCarContinuous-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
Pendulum-v1 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
ReacherBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Swimmer-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
Walker2DBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Walker2d-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10

DDPG

Use --algo ddpg.

Environment Runs
Ant-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
AntBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
BipedalWalker-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
BipedalWalkerHardcore-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HalfCheetah-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HalfCheetahBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
Hopper-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HopperBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
Humanoid-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HumanoidBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
InvertedDoublePendulumBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
InvertedPendulumSwingupBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
LunarLanderContinuous-v2 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
MountainCarContinuous-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
Pendulum-v1 ⬛⬛⬛⬜⬜⬜⬜⬜⬜⬜ 3/10
ReacherBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Swimmer-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Walker2DBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Walker2d-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10

DQN

Use --algo dqn.

Environment Runs
Acrobot-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
CartPole-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
LunarLander-v2 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
MountainCar-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
BeamRiderNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
BreakoutNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
EnduroNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
PongNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
QbertNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SeaquestNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SpaceInvadersNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10

PPO

Use --algo ppo.

Environment Runs
Acrobot-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 21/10
Ant-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
AntBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 21/10
BipedalWalker-v3 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 22/10
BipedalWalkerHardcore-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
CarRacing-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
CartPole-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 24/10
HalfCheetah-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HalfCheetahBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 21/10
Hopper-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HopperBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 22/10
Humanoid-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
HumanoidBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
HumanoidStandup-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
InvertedDoublePendulum-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
InvertedDoublePendulumBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
InvertedPendulum-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
InvertedPendulumSwingupBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
LunarLander-v2 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
LunarLanderContinuous-v2 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
MiniGrid-DoorKey-5x5-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
MiniGrid-FourRooms-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
MinitaurBulletDuckEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
MinitaurBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
MountainCar-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
MountainCarContinuous-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Pendulum-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 22/10
Reacher-v2 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
ReacherBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
Swimmer-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Walker2DBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬜⬜⬜⬜ 6/10
Walker2d-v3 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
BeamRiderNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
BreakoutNoFrameskip-v4 ⬛⬛⬛⬛⬜⬜⬜⬜⬜⬜ 4/10
EnduroNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
PongNoFrameskip-v4 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
QbertNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SeaquestNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SpaceInvadersNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10

PPO_LSTM

Use --algo ppo_lstm.

Environment Runs
Acrobot-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Ant-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
AntBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
BipedalWalker-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
BipedalWalkerHardcore-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
CarRacing-v0 ⬛⬛⬛⬜⬜⬜⬜⬜⬜⬜ 3/10
CartPoleNoVel-v1 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
HalfCheetah-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HalfCheetahBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Hopper-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HopperBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Humanoid-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HumanoidBulletEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HumanoidStandup-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
InvertedDoublePendulum-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
InvertedDoublePendulumBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
InvertedPendulum-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
InvertedPendulumSwingupBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
LunarLanderContinuousNoVel-v2 ⬛⬛⬛⬛⬛⬜⬜⬜⬜⬜ 5/10
LunarLanderNoVel-v2 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
MinitaurBulletDuckEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
MinitaurBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
MountainCarContinuousNoVel-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
MountainCarNoVel-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
PendulumNoVel-v1 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
Reacher-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
ReacherBulletEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
Swimmer-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
Walker2DBulletEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
Walker2d-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
BeamRiderNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
BreakoutNoFrameskip-v4 ⬛⬛⬛⬛⬜⬜⬜⬜⬜⬜ 4/10
EnduroNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
PongNoFrameskip-v4 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
QbertNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SeaquestNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SpaceInvadersNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10

QRDQN

Use --algo qrdqn.

Environment Runs
Acrobot-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
CartPole-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
LunarLander-v2 ⬛⬛⬛⬜⬜⬜⬜⬜⬜⬜ 3/10
MountainCar-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬜ 9/10
BeamRiderNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
BreakoutNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
EnduroNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
PongNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
QbertNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SeaquestNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
SpaceInvadersNoFrameskip-v4 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10

SAC

Use --algo sac.

Environment Runs
Ant-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
AntBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
BipedalWalker-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
BipedalWalkerHardcore-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
CarRacing-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
FetchReach-v1 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HalfCheetah-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HalfCheetahBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
Hopper-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HopperBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
Humanoid-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HumanoidBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
InvertedDoublePendulumBulletEnv-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
InvertedPendulumSwingupBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
LunarLanderContinuous-v2 ⬛⬛⬛⬜⬜⬜⬜⬜⬜⬜ 3/10
MinitaurBulletDuckEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
MinitaurBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
MountainCarContinuous-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
NeckEnvRelative-v2 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
Pendulum-v1 ⬛⬛⬛⬛⬜⬜⬜⬜⬜⬜ 4/10
ReacherBulletEnv-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
Swimmer-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Walker2DBulletEnv-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
Walker2d-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
donkey-generated-track-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10

TD3

Use --algo td3.

Environment Runs
Ant-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
AntBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
BipedalWalker-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
BipedalWalkerHardcore-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HalfCheetah-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HalfCheetahBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
Hopper-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HopperBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
Humanoid-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HumanoidBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
InvertedDoublePendulumBulletEnv-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
InvertedPendulumSwingupBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
LunarLanderContinuous-v2 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
MinitaurBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
MountainCarContinuous-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
Pendulum-v1 ⬛⬛⬛⬛⬜⬜⬜⬜⬜⬜ 4/10
ReacherBulletEnv-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
Swimmer-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Walker2DBulletEnv-v0 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
Walker2d-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10

TQC

Use --algo tqc.

Environment Runs
A1Walking-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
Ant-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
AntBulletEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
BipedalWalker-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
BipedalWalkerHardcore-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
CarRacing-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
FetchPickAndPlace-v1 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
FetchPush-v1 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
FetchReach-v1 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
FetchSlide-v1 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HalfCheetah-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HalfCheetahBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Hopper-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
HopperBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Humanoid-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HumanoidBulletEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
InvertedDoublePendulumBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
InvertedPendulumSwingupBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
LunarLanderContinuous-v2 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
MinitaurBulletEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
MountainCarContinuous-v0 ⬛⬛⬛⬛⬛⬛⬜⬜⬜⬜ 6/10
PandaPickAndPlace-v1 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
PandaPush-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 20/10
PandaReach-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 22/10
PandaSlide-v1 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
PandaStack-v1 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
Pendulum-v1 ⬛⬛⬛⬜⬜⬜⬜⬜⬜⬜ 3/10
ReacherBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
RocketLander-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Swimmer-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Walker2DBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Walker2d-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
parking-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10

TRPO

Use --algo trpo.

Environment Runs
Acrobot-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Ant-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
AntBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
BipedalWalker-v3 ⬛⬛⬜⬜⬜⬜⬜⬜⬜⬜ 2/10
BipedalWalkerHardcore-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
CartPole-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
HalfCheetah-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HalfCheetahBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Hopper-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HopperBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
Humanoid-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
HumanoidBulletEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
InvertedDoublePendulumBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
InvertedPendulumSwingupBulletEnv-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 10/10
LunarLander-v2 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 13/10
LunarLanderContinuous-v2 ⬛⬛⬛⬜⬜⬜⬜⬜⬜⬜ 3/10
MinitaurBulletDuckEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
MinitaurBulletEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
MountainCar-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 13/10
MountainCarContinuous-v0 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 12/10
Pendulum-v1 ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 11/10
ReacherBulletEnv-v0 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Swimmer-v3 ⬛⬜⬜⬜⬜⬜⬜⬜⬜⬜ 1/10
Walker2DBulletEnv-v0 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10
Walker2d-v3 ⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜ 0/10