基于强化学习经验优先提取的汽车纵向多态控制

Longitudinal polymorphic control of vehicle based on reinforcement learning with prioritized experience extraction

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：文章提出一种引入经验优先提取(prioritized experience extraction,PEE)规则的深度Q网络(deep Q network,DQN)算法,用于解决汽车纵向行驶时的多态控制问题.首先,建立车辆纵向力矩传递模型和强化学习算法模型,在进行算法移植以及制定奖励函数时综合考虑车速、距离等相关因素的综合限制;然后,通过仿真与硬件在环实验验证强化学习算法在汽车纵向多态控制方面的有效性;最后,引入PEE规则提高常规DQN算法的计算效率,解决算法区域性过拟合问题.PEE规则的引入有助于平滑主车的跟随车速,与相对距离相配合提升了行驶时的舒适性与安全性.

外文摘要：In order to solve the polymorphic control problem of the longitudinal motion of vehicle,a deep Q network(DQN)algorithm based on the rule of prioritized experience extraction(PEE)was put forward.The vehicle longitudinal torque transfer model and reinforcement learning algorithm model were analyzed and established.The transplantation of the algorithm and the formulation of reward function were made taking the comprehensive limitations of the relevant factors such as speed and dis-tance into consideration.Through the simulation and hardware-in-the-loop experiment,the effective-ness of deep reinforcement learning algorithm in the longitudinal polymorphic control of vehicle is ver-ified.In addition,PEE rule was introduced to improve the computational efficiency of conventional DQN algorithm and solve the overfitting problem to some extent.The PEE rule also realizes the smooth following speed of the main vehicle,which,in combination with the relative distance,im-proves the comfort and safety during driving.

外文关键词：

deep reinforcement learninglongitudinal controlpolymorphic controlprioritized experi-ence extraction(PEE)rule

作者：

黄鹤、付梦园、吴润晨、黄泽辰、曾琦、石琴

展开 >

作者单位：

合肥工业大学汽车与交通工程学院,安徽合肥 230009

合肥工业大学智能制造技术研究院,安徽合肥 230009

安徽省智慧交通车路协同工程研究中心,安徽合肥 230009

关键词：

深度强化学习纵向控制多态控制经验优先提取(PEE)规则

基金：

国家自然科学基金长三角科技创新共同体联合攻关资助项目(2023)安徽省新能源汽车暨智能网联汽车创新工程项目合工大智能院"科技成果转化及产业化"资助项目(2019)

项目编号：

719710732023CSJGG1600GXXT-2020-076IMICZ2019005

出版年：

2024

DOI：

10.3969/j.issn.1003-5060.2024.05.001

合肥工业大学学报(自然科学版)

合肥工业大学

合肥工业大学学报(自然科学版)

CSTPCD北大核心

影响因子：0.608

ISSN：1003-5060

年,卷(期)：2024.47(5)