Deep Reinforcement Learning implemented in pytorch.
This project trying to mimic the structure of rllab code and occasionally borrow code from rllab and inverse_rl
Implemented Algorithms
Online
DDPG (Deep Deterministic Policy Gradient)
NAF (Normalized Advantage Function)
Soft Q-Learning (Deep Energy-Based Policy)
Batch
TRPO (Trust Region Policy Optimization)
Inverse Reinforcement Learning & Imitation Learning
Generative Adversarial Imitation Learning
Adversarial Inverse Reinforcement Learning