tambetm / gymexperiments

MIT License
28 stars 12 forks source link

OpenAI Gym experiments

My implementations of normalized advantage functions (NAF) for continuous actions spaces and dueling network architecture (DUEL) for discrete action spaces.

Example results with NAF:

Example results with DUEL:

Prerequisites

You will need:

In Ubuntu that would be:

sudo apt-get install python-numpy python-sklearn
pip install --user gym keras

If you want to run Mujoco environments, you also need to acquire trial key and install the binaries. Then you can install Mujoco support for OpenAI Gym:

pip install --user gym[mujoco]

Running the code

There are three main starting points:

You can override default hyperparameters with command-line options, use -h to see them or check out the code.

Some other utility scipts: