Data normalization for a2c inputs?

hill-a / stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

MIT License

4.16k stars 725 forks source link

Hey, I've been experimenting with the A2C model, and I've noticed that the scaling of data plays a big role in the outcome produced. For example, I've passed the same data represented as decimals in the thousandths, and in the tenths, and gotten drastically different results. I was looking for some tips or intuitive explanations for best practices and what goes on behind the scenes, specifically what order of magnitude you've found works best and how to treat separate inputs of different orders of magnitude. Thank you. Edit: found good recourses in stable baselines docs.

hill-a / stable-baselines

Data normalization for a2c inputs? #1157