DQN rlax + bsuite vs rlax + gymnax

RobertTLange / gymnax

RL Environments in JAX 🌍

Apache License 2.0

585 stars 54 forks source link

DQN rlax + bsuite vs rlax + gymnax #3

Closed RobertTLange closed 3 years ago

RobertTLange commented 3 years ago

I would like to have a benchmark figure comparing the DQN example in rlax with a gymnax sped up version. Ideally, I want to compare the runtime for step transitions on different devices.

At the moment there is something wrong with the optimisation and/or evaluation. Figure out the bug :bug:.

The agents should all be in an experimental directory.

RobertTLange commented 3 years ago

Alternatively/additionally we can simply drop in for the Anakin Catch Example.
Also add the CMA-ES example for Pendulum-v0 as a notebook!

RobertTLange commented 3 years ago

Addressed in d7e262b3c395e3b35cdd49aefdc60467a40a8a9b and 9b92dbed7c05da366fbcd4e40111baa477951d2d.