基于深度Q网络的无人车侦察路径规划

扫码查看

原文链接

万方数据
维普

中文摘要：在城市战场环境下,无人侦察车有助于指挥部更好地了解目标地区情况,提升决策准确性,降低军事行动的威胁.目前,无人侦察车多采用阿克曼转向结构,传统算法规划的路径不符合无人侦察车的运动学模型.对此,将自行车运动模型和深度Q网络相结合,通过端到端的方式生成无人侦察车的运动轨迹.针对深度Q网络学习速度慢、泛化能力差的问题,根据神经网络的训练特点提出基于经验分类的深度Q网络,并提出具有一定泛化能力的状态空间.仿真实验结果表明,相较于传统路径规划算法,所提算法规划出的路径更符合无人侦察车的运动轨迹并提升无人侦察车的学习效率和泛化能力.

外文标题：Path planning for unmanned vehicle reconnaissance based on deep Q-network

外文摘要：In urban battlefield environments,unmanned reconnaissance vehicles help command centers better understand the situation in target areas,enhance decision-making accuracy,and reduce the threat of military operations.At present,unmanned reconnaissance vehicles mostly use Ackermann steering geometry.The path planned by the traditional algorithms does not conform to the kinematic model of the unmanned reconnaissance vehicle.Thus,the combination of bicycle motion model and deep Q-network are proposed to generate the motion trajectory of unmanned reconnaissance vehicles in an end-to-end manner.In order to solve the problems of slow learning speed and poor generalizing of deep Q-network,a deep Q-network based on experience classification according to the training characteristics of neural network and a state space with certain generalization ability are proposed.The simulation experiment results show that compared with the traditional path planning algorithms,the path planned by proposed algorithm is more in line with the movement trajectory of the unmanned reconnaissance vehicle,and which improve the learning efficiency and generalization ability of the unmanned reconnaissance vehicle.

外文关键词：

deep reinforcement learningunmanned reconnaissance vehiclepath planningdeep Q-network

作者：

夏雨奇、黄炎焱、陈恰

展开 >

作者单位：

南京理工大学自动化学院,江苏南京 210094

关键词：

深度强化学习无人侦察车路径规划深度Q网络

基金：

国家自然科学基金

项目编号：

61374186

出版年：

2024

DOI：

10.12305/j.issn.1001-506X.2024.09.19

系统工程与电子技术

中国航天科工防御技术研究院中国宇航学会中国系统工程学会

系统工程与电子技术

CSTPCD北大核心

影响因子：0.847

ISSN：1001-506X

年,卷(期)：2024.46(9)