Closed javiarrobas closed 3 years ago
This is to implement an example and a test with the A2C algorithm in the bestest_hydronic_heatpump case. This is the synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C), as described in https://arxiv.org/pdf/1602.01783.pdf, and documented in https://stable-baselines.readthedocs.io/en/master/modules/a2c.html.
This is to implement an example and a test with the A2C algorithm in the bestest_hydronic_heatpump case. This is the synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C), as described in https://arxiv.org/pdf/1602.01783.pdf, and documented in https://stable-baselines.readthedocs.io/en/master/modules/a2c.html.