ugr-sail / sinergym

Gym environment for building simulation and control using reinforcement learning
https://ugr-sail.github.io/sinergym/
MIT License
131 stars 35 forks source link

(v3.2.1) - Sinergym reward function improvement; new reward terms and metrics in CSV Logger and Callbacks. #388

Closed AlejandroCN7 closed 8 months ago

AlejandroCN7 commented 8 months ago

Description

This update enhances the modularization of the reward calculation process, introducing additional terms to the reward and info dictionaries returned by the environment.

Additionally, CSVLogger names have been refined, and these new metrics are now included. Corresponding adjustments have been made to the training and evaluation logging callbacks for DRL algorithms.

In essence, the reward now distinguishes more effectively between absolute values of energy and comfort violation, their respective absolute penalties, and the weighted terms summed in the reward. This enables better adaptation and facilitates the creation of new rewards inheriting from it.

Reward section has been improved in documentation, with new diagrams.

Types of changes

Checklist:

Changelog: