计算机工程与设计2024,Vol.45Issue(9) :2835-2843.DOI:10.16208/j.issn1000-7024.2024.09.037

基于深度强化学习的机械臂多模混合控制

Multi-mode hybrid control of manipulator based on deep reinforcement learning algorithm

李家乐 张建锋 李彬 刘天琅 陈检
计算机工程与设计2024,Vol.45Issue(9) :2835-2843.DOI:10.16208/j.issn1000-7024.2024.09.037

基于深度强化学习的机械臂多模混合控制

Multi-mode hybrid control of manipulator based on deep reinforcement learning algorithm

李家乐 1张建锋 1李彬 1刘天琅 1陈检1
扫码查看

作者信息

  • 1. 西北农林科技大学信息工程学院,陕西杨凌 712100
  • 折叠

摘要

针对基于深度强化学习控制的机械臂动态避障能力不足,在作业过程中存在多任务冲突问题,提出一种基于双重角色和正则化批评者算法(DARC)的多模混合控制方法.将任务分解为多段避障模式,借助人工势场法的斥力引力思想设计奖励函数并分别进行训练;将经过初步训练的多个模式以距离阈值或奖励积累阈值进行切换控制,消除混合控制存在的冲突;结合机械臂单元运动学特点设计具有软体性质的多连杆机械臂平面仿真模型.实验验证所提方法能够有效提升机械臂动态避障能力,避免多任务间的冲突.

Abstract

To address the problems of insufficient dynamic obstacle avoidance capability of robotic arm control based on deep rein-forcement learning,and the multi-task conflict that exists in the operation process,a multi-modal hybrid control method based on dual-role and regularized critic algorithm(DARC)was proposed.The task was decomposed into multiple obstacle avoidance modes,and the reward function was designed by using the repulsive-attractive idea of artificial potential field method and was trained separately.The multiple modes that had been preliminarily trained were switched by distance threshold or reward accu-mulation threshold to eliminate the conflict existing in hybrid control.A planar simulation model of multi-link robotic arm with soft body characteristics was designed by combining the kinematic characteristics of robotic arm unit.The experiment verifies that the proposed method can effectively improve the dynamic obstacle avoidance capability of robotic arm and avoid the conflict between multiple tasks.

关键词

深度强化学习/机械臂/双重演员正则化评论家算法(DARC)/奖励机制/动态避障/优先经验回放机制/连续动作控制

Key words

deep reinforcement learning/manipulator/double actors regularized critics algorithm/reward and punishment mecha-nism/dynamic obstacle avoidance/prioritized experience replay/continuous action control

引用本文复制引用

基金项目

陕西省重点研发计划基金项目(2023-YBNY-212)

陕西省自然科学基金项目(2021JQ-179)

出版年

2024
计算机工程与设计
中国航天科工集团二院706所

计算机工程与设计

CSTPCD北大核心
影响因子:0.617
ISSN:1000-7024
参考文献量12
段落导航相关论文