PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
MIT License
3.56k
stars
831
forks
source link
CartPole-v0 model can't be loaded by enjoy.py #148
Baselines Version: 0.1.5 python3 main.py --env-name "CartPole-v0" --num-frames 100000
python3 enjoy.py --load-dir trained_models/a2c --env-name "CartPole-v0"