基于深度强化学习的船舶路径规划方法研究

扫码查看

原文链接

万方数据
维普

中文摘要：针对现有路径规划算法在面对复杂环境时需要大量先验信息,并存在计算量大、转折过多、搜索精准度差等问题,使用深度强化学习算法可以弥补上述缺陷,但是存在算法本身收敛慢等问题.针对此问题,提出使用改进人工势场法(APF)对深度强化学习算法的奖励函数进行优化处理,并通过贝塞尔曲线对路径进行平滑处理,最终输出相对平滑的船舶航行路径.在相同环境下,将改进算法模型与现有方法的路径规划效果进行比较分析,结果表明,DQN-APF算法在生成的路径长度、平滑度、规划完成时间等船舶路径综合规划参数能力上得到了提升.

外文标题：Research on ship path planning method based on deep reinforcement learning

外文摘要：The existing path planning algorithm needs a large amount of prior information in the face of complex environment,and has problems such as large amount of computation,excessive transition and poor search accuracy.The use of deep reinforcement learning algorithm can make up the above defects,but the algorithm itself convergence is slow and other problems.To solve this problem,an improved Artificial Po-tential Field method(APF)is proposed to optimize the reward function of the deep reinforcement learning algorithm,and the path is smoothed by Bessel curve.Finally,a relatively smooth sailing path is output.Under the same environment,the effect of the improved algorithm model and the existing method of path planning is compared and analyzed.The results show that the DQN-APF algorithm has improved the ability of the generated path length,smoothness,planning completion time and other ship path comprehensive planning parameters.

外文关键词：

ship path planningdeep reinforcement learningimproved artificial potential fieldreward functionpath smoothing

作者：

杨长兵、张海华、刘焕牢

展开 >

作者单位：

南方海洋科学与工程广东省实验室(湛江),广东湛江 524000

广东海洋大学机械工程学院,广东湛江 524000

关键词：

船舶路径规划深度强化学习改进人工势场奖励函数路径平滑

基金：

广东省科研事业单位重点领域研发计划

项目编号：

2020B1111-500001

出版年：

2024

DOI：

10.13274/j.cnki.hdzj.2024.10.019

信息技术

黑龙江省信息技术学会中国电子信息产业发展研究院　中国信息产业部电子信息中心

信息技术

CSTPCD

影响因子：0.413

ISSN：1009-2552

年,卷(期)：2024.(10)