[Questions] how can we generate offline dataset like D4RL?

hakuhodo-technologies / scope-rl

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

Apache License 2.0

106 stars 10 forks source link

@return-sleep

Thank you for the question. For the dataset generation, please refer to this page in the documentation. Though we do not provide pre-trained agents for the data collection, one can train a policy online and use it as a data collection policy.

For the rendering of visual inputs, we do not have functions specialized for the visualization of the image observations. However, the offline reinforcement learning module should be able to handle it, taking advantage of PixelEncoderFactory provided in d3rlpy. For example, when training CQL, you can simply pass the instance as actor_encode_factory=pixel_encoder_factory, critic_encoder_factory=pixel_encoder_factory, as described in d3rlpy's documentation.

hakuhodo-technologies / scope-rl

[Questions] how can we generate offline dataset like D4RL? #24