twni2016 / pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
https://sites.google.com/view/pomdp-baselines
MIT License
300 stars 40 forks source link

Refactor the code on RL #14

Closed twni2016 closed 2 years ago

twni2016 commented 2 years ago

This PR is a major code refactor on policy and RL part. The idea is to remove RL algorithm specific details from policy*.py, e.g. TD3, SAC, SAC-D details. These details are moved to rl directory.

With such decoupling, one can create a new RL algorithm inheriting the RLAlgorithmBase class, without changing the policy* class.