nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2

Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
MIT License
101 stars 30 forks source link
bipedalwalker ddpg deep-reinforcement-learning lunar-lander openai-gym openai-gym-environments pytorch pytorch-implmention reinforcement-learning td3

TD3-BipedalWalker-v2-PyTorch

PyTorch implementation of Twin Delayed DDPG (TD3) tested on the following environments:

Usage

Dependencies

Trained and tested on:

Python 3.6
PyTorch 0.4.1
NumPy 1.15.3
gym 0.10.8
Roboschool 1.0.46
Pillow 5.3.0

Results

BipedalWalker-v2 (800 episodes) LunarLanderContinuous-v2 (1500 episodes)
RoboschoolWalker2d-v1 (lr=0.002, 1400 episodes) HalfCheetah-v1 (lr=0.002, 1400 episodes)

*The results are not consistent for BipedalWalker-v2 env

References