Is possible to filter experience from the episode whose length is longer than a specified value to add into replay_buffer?

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

MIT License

8.85k stars 1.68k forks source link

❓ Question

In the self-build custom environment, I suppose that some state-action pair that from very short episode is not helpful to the training process, thus I want to filter the experience from these episodes and do not push into the replay_buffer, can anyone tell me how to implement this function? thanks!

Checklist

[X] I have checked that there is no similar issue in the repo
[X] I have read the documentation
[X] If code there is, it is minimal and working
[X] If code there is, it is formatted using the markdown code blocks for both code and stack traces.

DLR-RM / stable-baselines3

Is possible to filter experience from the episode whose length is longer than a specified value to add into replay_buffer? #1999

❓ Question

Checklist