Given the incredible performance of the DDPG + JAX prototype (https://github.com/vwxyzjn/cleanrl/pull/187), it's worth prototyping JAX with other algorithms as well! This issue tracks the overall progress of integrating JAX with CleanRL.
In CleanRL a filediff is incredibly helpful. For example, if I want to learn how TD3 is different from DDPG, I could do
open VS code and select ddpg_continuous_action.py and td3_continuous_action.py
right-click and left-click "compare selected"
the following file diff window shows up
Contribution process
There is a contribution checklist to help streamline the contribution process. For each new contribution, we'd need to add documentation, tests, run benchmark experiments, etc. See https://github.com/vwxyzjn/cleanrl/pull/186 as an example.
Problem Description
Given the incredible performance of the DDPG + JAX prototype (https://github.com/vwxyzjn/cleanrl/pull/187), it's worth prototyping JAX with other algorithms as well! This issue tracks the overall progress of integrating JAX with CleanRL.
Useful resources
Common gotchas and errors:
Useful pattern when extending
In CleanRL a filediff is incredibly helpful. For example, if I want to learn how TD3 is different from DDPG, I could do
ddpg_continuous_action.py
andtd3_continuous_action.py
Contribution process
There is a contribution checklist to help streamline the contribution process. For each new contribution, we'd need to add documentation, tests, run benchmark experiments, etc. See https://github.com/vwxyzjn/cleanrl/pull/186 as an example.
Tracked issues