PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Firstly, thank you for the excellent repository.
Are there any pybullet expert trajectories available? And if not, how would I go about creating some?
Cheers,