CUN-bjy / gym-td3-keras

Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework
GNU General Public License v3.0
10 stars 4 forks source link
gym keras openai-gym per reinforcement-learning rl roboschool td3 tensorflow2

gym-td3-keras

Reference Code : gym-ddpg-keras(DDPG)

Keras Implementation of TD3(Twin Delayed Deep Deterministic Policy Gradient) with PER(Prioritized Experience Replay) option on OpenAI gym framework

STATUS : IN PROGRESS

This branch is just for debugging, change the branch to main.


To do

Test on Simulation


Experiment Details from paper

Network Model & Hyperparameter

Differences from DDPG

Exploration

Evaluation


Easy Installation

  1. Make an independent environment using virtualenv
# install virtualenv module
sudo apt-get install python3-pip
sudo pip3 install virtualenv

# create a virtual environment named venv
virtualenv venv 

# activate the environment
source venv/bin/activate 

​ To escape the environment, deactivate

  1. Install the requirements
pip install -r requirements.txt
  1. Run the training node
#trainnig
python train.py


Reference

[1] Addressing Function Approximation Error in Actor-Critic Methods

@misc{fujimoto2018addressing,
      title={Addressing Function Approximation Error in Actor-Critic Methods}, 
      author={Scott Fujimoto and Herke van Hoof and David Meger},
      year={2018},
      eprint={1802.09477},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

REVIEW | PAPER

[2] CUN-bjy/gym-ddpg-keras

[3] sfujim/TD3

[4] quantumiracle/SOTA-RL-Algorithms