Real-time UAV path planning based on LSTM network

扫码查看

原文链接

NETL
NSTL
万方数据
维普

外文摘要：To address the shortcomings of single-step decision making in the existing deep reinforcement learning based unmanned aerial vehicle(UAV)real-time path planning problem,a real-time UAV path planning algorithm based on long short-term memory(RPP-LSTM)network is proposed,which com-bines the memory characteristics of recurrent neural network(RNN)and the deep reinforcement learning algorithm.LSTM net-works are used in this algorithm as Q-value networks for the deep Q network(DQN)algorithm,which makes the decision of the Q-value network has some memory.Thanks to LSTM net-work,the Q-value network can use the previous environmental information and action information which effectively avoids the problem of single-step decision considering only the current environment.Besides,the algorithm proposes a hierarchical reward and punishment function for the specific problem of UAV real-time path planning,so that the UAV can more reasonably perform path planning.Simulation verification shows that com-pared with the traditional feed-forward neural network(FNN)based UAV autonomous path planning algorithm,the RPP-LSTM proposed in this paper can adapt to more complex environ-ments and has significantly improved robustness and accuracy when performing UAV real-time path planning.

外文关键词：

deep Q networkpath planningneural networkunmanned aerial vehicle(UAV)long short-term memory(LSTM)

作者：

ZHANG Jiandong、GUO Yukun、ZHENG Lihui、YANG Qiming、SHI Guoqing、WU Yong

展开 >

作者单位：

School of Electronics and Information,Northwestern Polytechnical University,Xi'an 710129,China

The Flight Automatic Control Research Institute of AVIC,Xi'an 710065,China

Military Representative Office of Marine Wuhan Bureau in Luoyang Area,Luoyang 471000,China

基金：

陕西省自然科学基金

项目编号：

2022JQ-593

出版年：

2024

DOI：

10.23919/JSEE.2023.000157

系统工程与电子技术(英文版)

中国航天科工防御技术研究院中国宇航学会中国系统工程学会中国系统仿真学会

系统工程与电子技术(英文版)

CSTPCD

影响因子：0.64

ISSN：1004-4132

年,卷(期)：2024.35(2)

参考文献量31