We study the performance of various deep reinforcement learning algorithms for the problem of microgrid’s energy management system. We propose a novel microgrid model that consists of a wind turbine generator, an energy storage system, a population of thermostatically controlled loads, a population of price-responsive loads, and a connection to the main grid. The proposed energy management system is designed to coordinate between the different sources of flexibility by defining the priority resources, the direct demand control signals and the electricity prices. Seven deep reinforcement learning algorithms are implemented and empirically compared in this paper. The numerical results show a significant difference between the different deep reinforcement learning algorithms in their ability to converge to optimal policies. By adding an experience replay and a second semi-deterministic training phase to the well-known Asynchronous advantage actor critic algorithm, we achieved considerably better performance and converged to superior policies in terms of energy efficiency and economic value.
I'm interested in what have you done. Could yo please help me how can I run the code .