I would like to have a benchmark figure comparing the DQN example in rlax with a gymnax sped up version. Ideally, I want to compare the runtime for step transitions on different devices.
At the moment there is something wrong with the optimisation and/or evaluation. Figure out the bug :bug:.
The agents should all be in an experimental directory.
I would like to have a benchmark figure comparing the DQN example in
rlax
with agymnax
sped up version. Ideally, I want to compare the runtime for step transitions on different devices.At the moment there is something wrong with the optimisation and/or evaluation. Figure out the bug :bug:.
The
agents
should all be in anexperimental
directory.