alirezakazemipour / DDPG-HER

Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
88 stars 18 forks source link
ddpg fetchpickandplace her hindsight-experience-replay

DDPG + HER

Implementation of the Deep Deterministic Policy Gradient with Hindsight Experience Replay Extension on the MuJoCo's robotic FetchPickAndPlace environment.

Visit vanilla_DDPG branch for the implementation without the HER extention.

Dependencies

Installation

pip3 install -r requirements.txt

Usage

mpirun -np $(nproc) python3 -u main.py

Demo

Result

Reference

  1. Continuous control with deep reinforcement learning, Lillicrap et al., 2015
  2. Hindsight Experience Replay, Andrychowicz et al., 2017
  3. Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research, Plappert et al., 2018

    Acknowledgement

    All the credit goes to @TianhongDai for his simplified implementation of the original OpenAI's code.