PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Would you be adding off-policy algorithms like ACER/SAC (which should be compatible for both cont action Mujoco and discrete action ALE tasks) and DDPG/TD3 for control to this repository any time soon?
Would be useful to have all these algos implemented within the same repo. I know this repo is being used as the standard codebase for a lot of papers these days.
Would you be adding off-policy algorithms like ACER/SAC (which should be compatible for both cont action Mujoco and discrete action ALE tasks) and DDPG/TD3 for control to this repository any time soon?
Would be useful to have all these algos implemented within the same repo. I know this repo is being used as the standard codebase for a lot of papers these days.