现代计算机2024,Vol.30Issue(5) :37-43.DOI:10.3969/j.issn.1007-1423.2024.05.006

未知环境下基于Dueling DQN的无人机路径规划研究

Research on UAV path planning based on Dueling DQN in unknown environment

赵恬恬 孔建国 梁海军 刘晨宇
现代计算机2024,Vol.30Issue(5) :37-43.DOI:10.3969/j.issn.1007-1423.2024.05.006

未知环境下基于Dueling DQN的无人机路径规划研究

Research on UAV path planning based on Dueling DQN in unknown environment

赵恬恬 1孔建国 1梁海军 1刘晨宇1
扫码查看

作者信息

  • 1. 中国民用航空飞行学院空中交通管理学院,广汉 618300
  • 折叠

摘要

为有效解决无人机在未知环境下的路径规划问题,提出一种基于Dueling DQN的路径规划方法.首先,在DQN的基础上,引入对抗网络架构,从而更好地提高成功率;其次,设计状态空间并定义离散化的动作和适当的奖励函数以引导无人机学习最优路径;最后在仿真环境中对DQN和Dueling DQN展开训练,结果表明:①Dueling DQN能规划出未知环境下从初始点到目标点的无碰撞路径,且能获得更高的奖励值;②经过50000次训练,Dueling DQN的成功率比DQN提高17.71%,碰撞率减少1.57%,超过最长步长率降低16.14%.

Abstract

In order to effectively solve the path planning problem of UAV in unknown environment,a path planning method based on Dueling DQN was proposed.Firstly,on the basis of DQN,the adversarial network architecture is introduced to better im-prove the success rate.Secondly,the state space is designed and discrete actions and appropriate reward functions are defined to guide the UAV to learn Xi optimal path.Finally,the DQN and Dueling DQN are trained in the simulation environment,and the re-sults show that:①Dueling DQN can plan the collision-free path from the initial point to the target point in the unknown environ-ment,and can obtain a higher reward value;②After 50000 training sessions,the success rate of Dueling DQN is 17.71%higher than that of DQN,the collision rate is reduced by 1.57%,and the rate of exceeding the longest step size is reduced by 16.14%.

关键词

无人机/路径规划/深度强化学习/Dueling/DQN算法

Key words

UAV/path planning/deep reinforcement learning/Dueling DQN algorithm

引用本文复制引用

基金项目

四川省科技计划资助项目(2022YFG0210)

中央高校基本科研业务费专项资金资助(PHD2023-035)

中央高校基本科研业务费专项资金资助(ZHMH2022-009)

出版年

2024
现代计算机
中大控股

现代计算机

影响因子:0.292
ISSN:1007-1423
参考文献量14
段落导航相关论文