ibpsa / project1-boptest-gym

Other
43 stars 20 forks source link

PPO2 #36

Closed javiarrobas closed 3 years ago

javiarrobas commented 3 years ago

This is to implement an example and a test with the PPO2 algorithm in the bestest_hydronic_heatpump case. This is the proximal policy optimization algorithm as described in in https://arxiv.org/abs/1707.06347, and documented in https://stable-baselines.readthedocs.io/en/master/modules/ppo2.html.