younggyoseo / pytorch-acer

PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)
16 stars 5 forks source link

pytorch-acer

An Implementation of Sample Efficient Actor-Critic with Experience Replay.

The paper proposes an off-policy Actor Critic algorithm with Experience Replay(ACER) in order to increase sample efficiency of actor critic algorithm.

This repository is based on pytorch-a2c-ppo-acktr and baselines/acer. A lot of codes are borrowed from them. If there are any license/property problems, please contact me.

A few notes:

Requirements

pytorch 0.4.1
numpy
gym
baselines
python 3.6

You can install gym and baselines for atari games:

git clone https://github.com/openai/baselines.git
cd baselines
pip install -e .
pip install 'atari[gym]'

Examples

python main.py --env 'PongNoFrameskip-v4' --num-processes 4 --recurrent-policy

PongNoFrameskip-v4

python main.py --env 'BreakoutNoFrameskip-v4' --num-processes 16

BreakoutNoFrameskip-v4

Acknowledgements