ugr-sail / sinergym

Gym environment for building simulation and control using reinforcement learning
https://ugr-sail.github.io/sinergym/
MIT License
131 stars 35 forks source link

(v3.3.1) - Sinergym observation normalization improved #408

Closed AlejandroCN7 closed 6 months ago

AlejandroCN7 commented 6 months ago

Description

The normalization wrapper in Sinergym, based on the default Gymnasium wrapper, has encountered a functionality issue.

When a model is trained using this normalization, the calibration is lost post-process. Consequently, when the model is loaded for evaluation, the absence of normalization calibration results in poor agent performance.

Motivation and Context

This PR records the averages and deviations updated during this process, and offer the ability to load them either in the constructor or via a method to use it in evaluation tasks. These evaluations are configurable, whose normalization calibrations can be not updated during that process.

Why is this change required? What problem does it solve? Please, reference issue or issues opened previously.

Fixes #407

Types of changes

Checklist:

Changelog:

Extra: