ikostrikov / pytorch-ddpg-naf

Implementation of algorithms for continuous control (DDPG and NAF).
MIT License
306 stars 70 forks source link
ddpg deep-deterministic-policy-gradient deep-learning pytorch reinforcement-learning

DEPRECATION

This repository is deprecated and is no longer maintaned. Please see a more recent implementation of RL for continuous control at jax-sac.

Description

Reimplementation of Continuous Deep Q-Learning with Model-based Acceleration and Continuous control with deep reinforcement learning.

Contributions are welcome. If you know how to make it more stable, don't hesitate to send a pull request.

Run

Use the default hyperparameters.

For NAF:

python main.py --algo NAF --env-name HalfCheetah-v2

For DDPG

python main.py --algo DDPG --env-name HalfCheetah-v2