首页|基于人工势场DDPG算法的移动机械臂协同避障轨迹规划

基于人工势场DDPG算法的移动机械臂协同避障轨迹规划

扫码查看
为了提高移动机械臂在狭窄通道和障碍物约束情况的避障轨迹规划能力,提出一种人工势场法(APF)和深度确定性策略梯度算法(DDPG)结合的改进算法(APF-DDPG).首先,对机械臂设计了 APF规划得到近似姿态,再将研究问题表示为马尔科夫决策过程,设计了状态空间、动作空间和奖惩函数,对规划过程进行阶段性分析处理,设计了一种引导机制来过渡各控制阶段,即避障阶段由DDPG主导训练,目标规划阶段由近似姿态引导DDPG训练,最终获得用于规划的策略模型.最后,建立并设计了固定和随机状态场景的仿真实验,验证了所提算法的有效性.实验结果表明,相较于传统DDPG算法,APF-DDPG算法能够以更高收敛效率训练得到具有更高效控制性能的策略模型.
Collaborative obstacle avoidance trajectory planning for mobile robotic arms based on artificial potential field DDPG algorithm
To improve the obstacle avoidance trajectory planning ability of mobile robotic arm in narrow channel and obstacle constraint situations,by combining Artificial Potential Field method(APF)and Deep Deterministic Policy Gradient algorithm(DDPG),an improved algorithm named APF-DDPG was proposed.The APF planning was de-signed for the robotic arm to get the approximate pose,and the research problem was represented as a Markov deci-sion process.The state space,action space and reward and punishment functions were designed,and the planning process was analyzed and processed in phases.A mechanism for guiding was designed to transition the various con-trol phases,which the obstacle avoidance phase of the training was dominated by DDPG,and the approximate pose dominated the goal planning phase to guide the DDPG for the training.Thus the strategy model for planning was ob-tained from the training.Finally,simulation experiments of fixed and random state scenarios were established and designed to verify the effectiveness of the proposed algorithm.The experimental results showed that APF-DDPG al-gorithm could be trained with higher convergence efficiency to obtain a policy model with more efficient control per-formance by comparing with the traditional DDPG algorithm.

mobile robotic armobstacle avoidance trajectory planningartificial potential fielddeep deterministic policy gradientguided training

李勇、张朝兴、柴燎宁

展开 >

重庆邮电大学工业物联网与网络化控制教育部重点实验室,重庆 400065

移动机械臂 避障轨迹规划 人工势场法 深度确定性策略梯度 引导训练

2024

计算机集成制造系统
中国兵器工业集团第210研究所

计算机集成制造系统

CSTPCD北大核心
影响因子:1.092
ISSN:1006-5911
年,卷(期):2024.30(12)