基于深度Q网络的机器人路径规划研究综述

扫码查看

原文链接

万方数据
维普

中文摘要：随着深度强化学习的不断发展,深度Q网络(DQN)在机器人路径规划中得到广泛关注和研究.首先,简要介绍DQN以及Nature DQN、Double DQN、Dueling DQN和D3QN等算法的基本原理和改进思想.针对算法存在的样本获取成本高和交互效率低的问题,系统梳理并总结了从奖励函数、探索能力、样本利用率等方面进行优化的研究成果和思路.最后,讨论了DQN在现代物流中进行机器人路径规划的优势,对每个场景提出了算法的优化方向,涵盖状态空间、动作空间以及奖励函数等多个关键方面.

外文标题：Research review of robot path planning based on DQN

外文摘要：With the continuous development of deep reinforcement learning,deep Q-learning network(DQN)has received extensive attention and research in robot path planning.Firstly,the basic principles and improvement ideas of DQN and algorithms such as Nature DQN,Double DQN,Dueling DQN and D3QN is briefly introduced.In view of the problems of high sample acquisition cost and low interaction efficiency in the algorithm,the research results and ideas of optimization from reward function,exploration ability,sample utilization rate,etc are systematically sorted and summarized.Finally,the advantages of DQN in robot path planning in modern logistics is discussed,and optimization directions for each scenario is proposed covering key aspects such as state space,action space,and reward function.

外文关键词：

robotpath planningdeep Q-learning network(DQN)modern logistics

作者：

卢锦澎、梁宏斌

展开 >

作者单位：

西南交通大学交通运输与物流学院,四川成都541004

关键词：

机器人路径规划深度Q网络现代物流

基金：

国家自然科学基金面上项目

项目编号：

62071398

出版年：

2024

DOI：

10.13873/J.1000-9787(2024)06-0001-05

传感器与微系统

中国电子科技集团公司第四十九研究所

传感器与微系统

CSTPCD北大核心

影响因子：0.61

ISSN：1000-9787

年,卷(期)：2024.43(6)

参考文献量45