mansicer / baselax

Baselax (Baselines + JAX) provides stable-baselines-style implementations of reinforcement learning (RL) algorithms with Google JAX framework.
https://baselax.readthedocs.io
Apache License 2.0
4 stars 1 forks source link

Feature: abstract agent design and an example DQN implementation #7

Closed mansicer closed 2 years ago

mansicer commented 2 years ago

This PR proposes an abstract BaseAgent design and an implementation of DQN based on this design. More details will be revealed with docstring format soon. Currently the TODO list contains:

mansicer commented 2 years ago

Some minor updates to file structures and dependencies:

mansicer commented 2 years ago

As current agent-design branch contains many changes to the core package (dependencies, versions, etc.). We decide to merge this PR and leave support of the rest of features support as further developments.