复杂环境下仿蛇机器人的路径规划策略

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：为完成多障碍物复杂环境中的侦察任务,改善仿蛇机器人的路径搜索能力和提升自主决策能力,本文中提出了一种基于先验知识推理库强化学习的路径规划控制策略.首先,建立仿蛇机器人的运动模型和交互环境模型;其次,通过建立模糊逻辑先验知识推理库(FLIS)与Soft Actor-Critic(SAC)动作网络组成的动作分层选择模型,将输出动作空间进行离散化的方法来调整运动控制精度,提供奖惩机制指导机器人与环境模型进行交互,实现机器人运动自主决策的持续生成过程.仿真结果表明:在多障碍物环境中所提出的改进算法得到的运动控制策略收敛速度和鲁棒性明显提升,降低了训练探索次数,提高了机器人对复杂环境的适应性;试验验证了训练所生成的策略模型在真实环境中的可行性.

外文标题：Path planning strategy of snake-like robot in complex environment

外文摘要：In order to complete the reconnaissance task in the complex environment of multiple obstacles,improve the path search ability of the snake-like robot and enhance the autonomous decision-making ability,this paper proposes a path planning control strategy based on prior knowledge inference library reinforcement learning.Firstly,the motion model and interactive environment model of snake-like robot are established.Secondly,by establishing an action hierarchical selection model composed of Fuzzy logic prior knowledge inference system(FLIS)and Soft Actor-Critic(SAC)action network,the output action space is discretized to adjust the motion control accuracy,and a reward and punishment mechanism is provided to guide the robot to interact with the environment model to realize the continuous generation process of robot motion autonomous decision-making.The simulation results show that the convergence speed and robustness of the motion control strategy obtained by the improved algorithm proposed in the multi-obstacle environment are significantly improved,the number of training explorations is reduced,and the adaptability of the robot to the complex environment is improved.The experiment verifies the feasibility of the strategy model generated by the training in the real environment.

外文关键词：

snake robotreinforcement learningprior knowledgepath planningautonomous obstacle a-voidance

作者：

李伟庆、王永娟、高云龙

展开 >

作者单位：

南京理工大学机械工程学院,南京 210094

关键词：

仿蛇机器人强化学习先验知识路径规划自主避障

出版年：

2024

DOI：

10.11809/bqzbgcxb2024.07.004

兵器装备工程学报

重庆市（四川省）兵工学会重庆理工大学

兵器装备工程学报

CSTPCD北大核心

影响因子：0.478

ISSN：2096-2304

年,卷(期)：2024.45(7)

参考文献量6