Open tbskrpmnns opened 2 years ago
Hey,
is there a way to use your implementation with a fixed MDP dataset instead of an environment for 100% offline RL?
Hey,
is there a way to use your implementation with a fixed MDP dataset instead of an environment for 100% offline RL?