基于LSTM车速预测和深度确定性策略梯度的增程式电动汽车能量管理

DDPG Energy Management of Extended-Range Electric Vehicle Based on LSTM Speed Prediction

路来伟 ¹赵红 ¹徐福良 ¹罗勇¹

扫码查看

作者信息

1. 青岛大学,青岛 266071
折叠

摘要

为提高增程式电动汽车的能量管理性能,首先利用长短时记忆(LSTM)神经网络进行车速预测,然后计算出预测时域内的需求功率,并将其与当前时刻的需求功率共同输入深度确定性策略梯度(DDPG)智能体,由智能体输出控制量,最后通过硬件在环仿真验证了控制策略的实时性.结果表明,采用所提出的LSTM-DDPG能量管理策略相对于DDPG能量管理策略、深度Q网络(DQN)能量管理策略、功率跟随控制策略在世界重型商用车辆瞬态循环(WTVC)工况下的等效燃油消耗量分别减少0.613 kg、0.350 kg、0.607 kg,与采用动态规划控制策略时的等效燃油消耗量仅相差0.128 kg.

Abstract

In order to improve the energy management of Range Extended Electric Vehicle(REEV),firstly Long Short-Term Memory(LSTM)neural network was used to predicate vehicle speed,then calculates the demand power in the prediction time domain,and the demand power in the prediction time domain and the demand power at the current moment were jointly inputted to the Deep Deterministic Policy Gradient(DDPG)agent,which outputted the control quantity.Finally,the hardware-in-the-loop simulation was carried out to verify the real-time performance of the control strategy.The validation results show that using the proposed LSTM-DDPG energy management strategy reduces the equivalent fuel consumption by 0.613 kg,0.350 kg,and 0.607 kg compared to the DDPG energy management strategy,the Deep Q-Network(DQN)energy management strategy,and the power-following control strategy,respectively,under the World Transient Vehicle Cycling(WTVC)conditions,which is only 0.128 kg different from that of the dynamic planning control strategy when the dynamic planning control strategy is used.

关键词

增程式电动汽车/长短时记忆神经网络/深度强化学习/深度确定性策略梯度

Key words

Extended-range electric vehicle/Long Short-Term Memory(LSTM)neural network/Deep Reinforcement Learning(DRL)/Deep Deterministic Policy Gradient(DDPG)

引用本文复制引用

基金项目

国家自然科学基金项目(52175236)

青岛市科技惠民示范专项(24-1-8-cspz-18-nsh)

出版年

2024

汽车技术

中国汽车工程学会长春汽车研究所

汽车技术

CSTPCD北大核心

影响因子：0.522

ISSN：1000-3703

段落导航