DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
https://stable-baselines3.readthedocs.io
MIT License
8.35k stars 1.6k forks source link

proposed fix for RunningMeanStd overflow #1954

Open spiglerg opened 1 week ago

spiglerg commented 1 week ago

Connected to Issue https://github.com/DLR-RM/stable-baselines3/issues/1953

Description

RunningMeanStd is made robust to overflows with two modifications:

Motivation and Context

Types of changes

Checklist