DDPG + HER

Implementation of the Deep Deterministic Policy Gradient with Hindsight Experience Replay Extension on the MuJoCo's robotic FetchPickAndPlace environment.

Visit vanilla_DDPG branch for the implementation without the HER extention.

Dependencies

gym == 0.17.2
matplotlib == 3.1.2
mpi4py == 3.0.3
mujoco-py == 2.0.2.13
numpy == 1.19.1
opencv_contrib_python == 3.4.0.12
psutil == 5.4.2
torch == 1.4.0

Installation

pip3 install -r requirements.txt

Usage

mpirun -np $(nproc) python3 -u main.py

Demo

Result

Reference

Continuous control with deep reinforcement learning, Lillicrap et al., 2015
Hindsight Experience Replay, Andrychowicz et al., 2017
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research, Plappert et al., 2018
Acknowledgement

All the credit goes to @TianhongDai for his simplified implementation of the original OpenAI's code.

alirezakazemipour / DDPG-HER

readme

DDPG + HER

Dependencies

Installation

Usage

Demo

Result

Reference

Acknowledgement