首页|基于深度Q网络的无人车侦察路径规划

基于深度Q网络的无人车侦察路径规划

扫码查看
在城市战场环境下,无人侦察车有助于指挥部更好地了解目标地区情况,提升决策准确性,降低军事行动的威胁。目前,无人侦察车多采用阿克曼转向结构,传统算法规划的路径不符合无人侦察车的运动学模型。对此,将自行车运动模型和深度Q网络相结合,通过端到端的方式生成无人侦察车的运动轨迹。针对深度Q网络学习速度慢、泛化能力差的问题,根据神经网络的训练特点提出基于经验分类的深度Q网络,并提出具有一定泛化能力的状态空间。仿真实验结果表明,相较于传统路径规划算法,所提算法规划出的路径更符合无人侦察车的运动轨迹并提升无人侦察车的学习效率和泛化能力。
Path planning for unmanned vehicle reconnaissance based on deep Q-network
In urban battlefield environments,unmanned reconnaissance vehicles help command centers better understand the situation in target areas,enhance decision-making accuracy,and reduce the threat of military operations.At present,unmanned reconnaissance vehicles mostly use Ackermann steering geometry.The path planned by the traditional algorithms does not conform to the kinematic model of the unmanned reconnaissance vehicle.Thus,the combination of bicycle motion model and deep Q-network are proposed to generate the motion trajectory of unmanned reconnaissance vehicles in an end-to-end manner.In order to solve the problems of slow learning speed and poor generalizing of deep Q-network,a deep Q-network based on experience classification according to the training characteristics of neural network and a state space with certain generalization ability are proposed.The simulation experiment results show that compared with the traditional path planning algorithms,the path planned by proposed algorithm is more in line with the movement trajectory of the unmanned reconnaissance vehicle,and which improve the learning efficiency and generalization ability of the unmanned reconnaissance vehicle.

deep reinforcement learningunmanned reconnaissance vehiclepath planningdeep Q-network

夏雨奇、黄炎焱、陈恰

展开 >

南京理工大学自动化学院,江苏南京 210094

深度强化学习 无人侦察车 路径规划 深度Q网络

国家自然科学基金

61374186

2024

系统工程与电子技术
中国航天科工防御技术研究院 中国宇航学会 中国系统工程学会

系统工程与电子技术

CSTPCD北大核心
影响因子:0.847
ISSN:1001-506X
年,卷(期):2024.46(9)