https://journals.sagepub.com/doi/10.1177/09544070231193187
This paper improves the economy of the extended-range electric logistics van (ERELV) platoon from two aspects of cooperative adaptive cruise control (CACC) and energy management strategy (EMS).
Based on the DMPC algorithm, an Eco-CACC controller with the optimization goals of stability, comfort and energy-saving is designed.
The multi-agent deep reinforcement learning (MADRL) algorithm is used to solve the EMS of the ERELV platoon.
DQN, DDPG and MADDPG were used as energy management strategies for extended range electric logistics van platoon.
Happy to answer any questions you have. Please email us at wg@njust.edu.cn
A single-point optimization problem is defined on each vehicle node in the platoon, and the model prediction optimization problem of all nodes needs to be solved at each optimization moment. Using the domain node information obtained by the PLF communication topology, each sub-problem is optimized to obtain the control input of the vehicle.
The picture shows the basic framework of EMS based on deep reinforcement learning (DRL).The environment model includes hybrid power system and driving environment, and the agent module contains learning algorithm. Through the interaction between agent and environment to optimize network parameters, agent will learn to output optimal actions to the environment to maximize the cumulative reward.
The state transition of ERELV has the Markov property and generates a corresponding energy consumption at each time step. Since each agent cannot know the complete environment state, the EMS of the ERELV platoon can be modeled as a partially observable POMDP. The task types applied by MADRL algorithm are divided into fully cooperative, fully competitive and hybrid types. For the EMS of the ERELV platoon, each vehicle node needs to jointly explore the optimal control behavior under different vehicle states, so it is a fully cooperative MADRL problem.