Closed BY571 closed 1 day ago
Note: Links to docs will display an error until the docs builds have been completed.
As of commit acc9baf52e8eced45cefd35c75fccc4ca6d76cf7 with merge base a151923d5e1eab68a8256fa02d0946a0f57bd5d0 ():
* [Continuous Benchmark (PR) / CPU Pytest benchmark](https://hud.pytorch.org/pr/pytorch/rl/2249#27229976085) ([gh](https://github.com/pytorch/rl/actions/runs/9860746514/job/27229976085)) `Workflow failed! Resource not accessible by integration` * [Continuous Benchmark (PR) / GPU Pytest benchmark](https://hud.pytorch.org/pr/pytorch/rl/2249#27229976600) ([gh](https://github.com/pytorch/rl/actions/runs/9860746514/job/27229976600)) `Workflow failed! Resource not accessible by integration` * [Habitat Tests on Linux / tests (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2249#27229978498) ([gh](https://github.com/pytorch/rl/actions/runs/9860746541/job/27229978498)) `RuntimeError: Command docker exec -t 37973d1520f3a39f9c4b301ce2be78d94e101d27e29255fb4e6e9463c168ed64 /exec failed with exit code 139` * [Unit-tests on Linux / tests-optdeps (3.10, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2249#27229983480) ([gh](https://github.com/pytorch/rl/actions/runs/9860746535/job/27229983480)) `RuntimeError: Command docker exec -t 7cedf586ca5a98e3db66f8ca9d4dce6473d24227dac7d79bb9587ea3d04cd1fa /exec failed with exit code 1`
đŸ‘‰ Rebase onto the `viable/strict` branch to avoid these failures
* [Unit-tests on Windows / unittests-cpu / windows-job](https://hud.pytorch.org/pr/pytorch/rl/2249#27229977152) ([gh](https://github.com/pytorch/rl/actions/runs/9860746531/job/27229977152)) ([trunk failure](https://hud.pytorch.org/pytorch/rl/commit/a151923d5e1eab68a8256fa02d0946a0f57bd5d0#27170792519)) `test/test_transforms.py::TestActionDiscretizer::test_trans_parallel_env_check[False]`
This comment was automatically generated by Dr. CI and updates every 15 minutes.
Docstrings are updated :)
Description
Adds the offline RL algorithm TD3+BC as described in the paper A Minimalist Approach to Offline Reinforcement Learning.
Performance is verified for half-cheeta medium dataset:![image](https://github.com/pytorch/rl/assets/29492081/18ab8812-ab2c-46b6-adb2-16ce8b8b2df5)
Motivation and Context
Why is this change required? What problem does it solve? If it fixes an open issue, please link to the issue here. You can use the syntax
close #15213
if this solves the issue #15213Types of changes
What types of changes does your code introduce? Remove all that do not apply:
Checklist
Go over all the following points, and put an
x
in all the boxes that apply. If you are unsure about any of these, don't hesitate to ask. We are here to help!