衰减高斯噪声DDPG算法的机械臂轨迹规划

Trajectory planning of robotic arm based on the gaussian decaying noise DDPG algorithm

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：针对农业采摘机械臂的DDPG算法轨迹规任务中,调查了因高斯噪声标准差取值不当导致的强化学习训练失败问题,提出一种衰减正态噪声的DDPG算法,使高斯标准差σ随训练回合数增加而减小;利用Mujoco物理引擎进行多次仿真训练,验证衰减正态噪声相较于传统正态噪声在轨迹规划任务中的优势.结果表明,改进后的算法在完成采摘机械臂的轨迹规划任务时更为有效,成功解决了存在的问题.

外文摘要：The issue of DDPG algorithm training failure caused by inappropriate values of Gaussian noise standard deviation in the trajectory planning task of agricultural picking robotic arms was investigated.To address this problem,a decaying normal noise DDPG algorithm was proposed,where the Gaussian standard deviation σ decreases as the number of training episodes increases.Multiple simulation training sessions using the Mujoco physics engine were conducted to verify the advantages of decaying normal noise over traditional normal noise in trajectory planning tasks.The results showed that the improved algorithm was more effective in completing the trajectory planning task of the picking robotic arm,successfully solving the problem.

外文关键词：

reinforcement learningDDPG algorithmgaussian noiserobotic armtrajectory planning

作者：

周雨溪、赵慧、韩晓峰

展开 >

作者单位：

武汉科技大学湖北省机械传动与制造工程重点实验室,湖北武汉 430081

武汉科技大学机器人与智能系统研究院,湖北武汉 430081

关键词：

强化学习 DDPG算法正态噪声机械臂轨迹规划

出版年：

2024

DOI：

10.3969/j.issn.1673-3142.2024.10.019

农业装备与车辆工程

山东省农业机械科学研究所山东农机学会

农业装备与车辆工程

影响因子：0.279

ISSN：1673-3142

年,卷(期)：2024.62(10)