ibpsa / project1-boptest-gym

Other
43 stars 20 forks source link

A2C implementation #34

Closed javiarrobas closed 3 years ago

javiarrobas commented 3 years ago

This is to implement an example and a test with the A2C algorithm in the bestest_hydronic_heatpump case. This is the synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C), as described in https://arxiv.org/pdf/1602.01783.pdf, and documented in https://stable-baselines.readthedocs.io/en/master/modules/a2c.html.