Farama-Foundation / D4RL

A collection of reference environments for offline reinforcement learning
Apache License 2.0
1.25k stars 269 forks source link

[Proposal] Humanoid Offline RL data #221

Open prajjwal1 opened 10 months ago

prajjwal1 commented 10 months ago

Proposal

I was wondering if there's interest in making humanoid part of D4RL. The dataset was introduced in the following paper

Paper: https://arxiv.org/abs/2305.14550

The link for the data is here: https://dl.fbaipublicfiles.com/prajj/rl_paradigm/humanoid_offline_rl_data.tar.gz Repo : https://github.com/prajjwal1/rl_paradigm

Motivation

Making humanoid available would be helpful for offline RL community for the same reason existing datasets are. Humanoid is more challenging in some ways than existing D4RL datasets such as state space dimension.

We provide medium, medium-expert and expert data for humanoid all in the same format as D4RL. It contains a lot of timestep data than what existing D4RL datasets provide.

Cc @rodrigodelazcano

egg-west commented 9 months ago

It is a good idea to me. Humanoid has much higher-dimensional observation space. This property exacerbates the challenge of distributional shift for offline RL. It would be helpful to include medium-replay and random as well, to make this data consistent with the current MuJoCo locomotion datasets.