一种面向动态场景的无人艇路径规划方法

扫码查看

原文链接

万方数据
维普

中文摘要：文中提出了一种融合时空机制的改进Q学习算法(itegrated spatio-temporal mechanism Q-learning,ISTM).根据动态场景中障碍物时空变化特性,建立动、静态障碍物混合的动态环境;通过引入时空机制构建动态奖惩函数,根据障碍物动态变化情况展开路径搜索,提高无人艇对障碍物状态变化的感知能力;建立动态探索机制,通过自适应调整贪婪因子,提升无人艇在动态场景中的探索效率.结果表明:基于ISTM算法的路径规划收敛时间更短,稳定性更高,所规划出的路径更短.

外文标题：A Path Planning Method for Unmanned Surface Vehicles Oriented to Dynamic Scenes

外文摘要：An improved Q-learning algorithm(ISTM)integrating spatio-temporal mechanism was pro-posed.According to the temporal and spatial variation characteristics of obstacles in the dynamic scene,a dynamic environment with mixed dynamic and static obstacles was established.The dynamic reward and punishment function was constructed by introducing the space-time mechanism,and the path search was carried out according to the dynamic change of obstacles,so as to improve the percep-tion ability of unmanned boats to the change of obstacle state.The dynamic exploration mechanism was established,and the greedy factor was adjusted adaptively to improve the exploration efficiency of unmanned boats in dynamic scenes.The results show that the path planning based on ISTM algorithm has shorter convergence time,higher stability and shorter planned path.

外文关键词：

path planningreinforcement learningdynamic environmentspatiotemporal mechanismunmanned surface vehicle

作者：

何正伟、徐小本、汪成立

展开 >

作者单位：

武汉理工大学航运学院武汉 430063

浙江省交通运输科学研究院杭州 311305

关键词：

路径规划强化学习动态环境时空机制无人艇

基金：

浙江省科学技术厅重点研究计划浙江省科学技术厅重点研究计划湖北省重点研究计划项目

项目编号：

2021C01010ZJJKY2021-DY-0162023BAB013

出版年：

2024

DOI：

10.3963/j.issn.2095-3844.2024.02.035

武汉理工大学学报(交通科学与工程版)

武汉理工大学

武汉理工大学学报(交通科学与工程版)

CSTPCD

影响因子：0.462

ISSN：2095-3844

年,卷(期)：2024.48(2)

参考文献量2