ugr-sail / sinergym

Gym environment for building simulation and control using reinforcement learning
https://ugr-sail.github.io/sinergym/
MIT License
127 stars 34 forks source link

(v3.3.7) - Fix normalization calibration in evaluations #420

Closed AlejandroCN7 closed 2 months ago

AlejandroCN7 commented 2 months ago

Description

This PR addresses issue #419, which works perfectly for most environments. However, for some reason, this adjustment causes an error in environments with specific combinations of buildings and climates (though the vast majority function correctly).

Despite extensive debugging efforts, the origin cause remains unidentified. In these cases, the Stable Baselines 3 agent returns nans instead of valid values when the evaluation process begins.

Therefore, an alternative method to synchronize normalization between evaluation and training has been implemented, which appears to be error-free. If any other bug arise, an issue will be created to report the problem.

Types of changes

Checklist:

Changelog: