This PR is a major code refactor on policy and RL part. The idea is to remove RL algorithm specific details from policy*.py, e.g. TD3, SAC, SAC-D details. These details are moved to rl directory.
With such decoupling, one can create a new RL algorithm inheriting the RLAlgorithmBase class, without changing the policy* class.
This PR is a major code refactor on policy and RL part. The idea is to remove RL algorithm specific details from
policy*.py
, e.g. TD3, SAC, SAC-D details. These details are moved torl
directory.With such decoupling, one can create a new RL algorithm inheriting the
RLAlgorithmBase
class, without changing thepolicy*
class.