hill-a / stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
http://stable-baselines.readthedocs.io/
MIT License
4.16k stars 725 forks source link

Data normalization for a2c inputs? #1157

Closed DaniilKardava closed 2 years ago

DaniilKardava commented 2 years ago

Hey, I've been experimenting with the A2C model, and I've noticed that the scaling of data plays a big role in the outcome produced. For example, I've passed the same data represented as decimals in the thousandths, and in the tenths, and gotten drastically different results. I was looking for some tips or intuitive explanations for best practices and what goes on behind the scenes, specifically what order of magnitude you've found works best and how to treat separate inputs of different orders of magnitude. Thank you. Edit: found good recourses in stable baselines docs.

Miffyli commented 2 years ago

Yup, as you noticed, docs have the most resources we have to offer :). Also we recommend using stable-baselines3, which has better docs and maintained code-base.

Closing as resolved.