Stable-Baselines-Team / stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
https://sb3-contrib.readthedocs.io
MIT License
465 stars 173 forks source link

Add rollout_buffer_class to TRPO #214

Closed ernestum closed 11 months ago

ernestum commented 11 months ago

Description

This adds the rollout_buffer_class to TRPO in sync with https://github.com/DLR-RM/stable-baselines3/pull/1720. I decided not to add it to MaskeablePPO and RecurrentPPO since they require special rollout buffers.

Context

Types of changes

Checklist:

Note: we are using a maximum length of 127 characters per line

araffin commented 11 months ago

Fyi, I already did a pre-release https://pypi.org/project/stable-baselines3/2.2.0a9/, and you would need it to build this SB3 contrib version ;)

ernestum commented 11 months ago

Oh nice thanks!