pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
https://pytorch.org/rl
MIT License
2.01k stars 269 forks source link

[Algorithm] TD3+BC #2249

Closed BY571 closed 1 day ago

BY571 commented 2 weeks ago

Description

Adds the offline RL algorithm TD3+BC as described in the paper A Minimalist Approach to Offline Reinforcement Learning.

Performance is verified for half-cheeta medium dataset: image

Motivation and Context

Why is this change required? What problem does it solve? If it fixes an open issue, please link to the issue here. You can use the syntax close #15213 if this solves the issue #15213

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Checklist

Go over all the following points, and put an x in all the boxes that apply. If you are unsure about any of these, don't hesitate to ask. We are here to help!

pytorch-bot[bot] commented 2 weeks ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2249

Note: Links to docs will display an error until the docs builds have been completed.

:x: 4 New Failures, 1 Unrelated Failure

As of commit acc9baf52e8eced45cefd35c75fccc4ca6d76cf7 with merge base a151923d5e1eab68a8256fa02d0946a0f57bd5d0 (image):

NEW FAILURES - The following jobs have failed:

* [Continuous Benchmark (PR) / CPU Pytest benchmark](https://hud.pytorch.org/pr/pytorch/rl/2249#27229976085) ([gh](https://github.com/pytorch/rl/actions/runs/9860746514/job/27229976085)) `Workflow failed! Resource not accessible by integration` * [Continuous Benchmark (PR) / GPU Pytest benchmark](https://hud.pytorch.org/pr/pytorch/rl/2249#27229976600) ([gh](https://github.com/pytorch/rl/actions/runs/9860746514/job/27229976600)) `Workflow failed! Resource not accessible by integration` * [Habitat Tests on Linux / tests (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2249#27229978498) ([gh](https://github.com/pytorch/rl/actions/runs/9860746541/job/27229978498)) `RuntimeError: Command docker exec -t 37973d1520f3a39f9c4b301ce2be78d94e101d27e29255fb4e6e9463c168ed64 /exec failed with exit code 139` * [Unit-tests on Linux / tests-optdeps (3.10, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2249#27229983480) ([gh](https://github.com/pytorch/rl/actions/runs/9860746535/job/27229983480)) `RuntimeError: Command docker exec -t 7cedf586ca5a98e3db66f8ca9d4dce6473d24227dac7d79bb9587ea3d04cd1fa /exec failed with exit code 1`

BROKEN TRUNK - The following job failed but were present on the merge base:

đŸ‘‰ Rebase onto the `viable/strict` branch to avoid these failures

* [Unit-tests on Windows / unittests-cpu / windows-job](https://hud.pytorch.org/pr/pytorch/rl/2249#27229977152) ([gh](https://github.com/pytorch/rl/actions/runs/9860746531/job/27229977152)) ([trunk failure](https://hud.pytorch.org/pytorch/rl/commit/a151923d5e1eab68a8256fa02d0946a0f57bd5d0#27170792519)) `test/test_transforms.py::TestActionDiscretizer::test_trans_parallel_env_check[False]`

This comment was automatically generated by Dr. CI and updates every 15 minutes.

BY571 commented 1 day ago

Docstrings are updated :)