首页|衰减高斯噪声DDPG算法的机械臂轨迹规划

衰减高斯噪声DDPG算法的机械臂轨迹规划

扫码查看
针对农业采摘机械臂的DDPG算法轨迹规任务中,调查了因高斯噪声标准差取值不当导致的强化学习训练失败问题,提出一种衰减正态噪声的DDPG算法,使高斯标准差σ随训练回合数增加而减小;利用Mujoco物理引擎进行多次仿真训练,验证衰减正态噪声相较于传统正态噪声在轨迹规划任务中的优势.结果表明,改进后的算法在完成采摘机械臂的轨迹规划任务时更为有效,成功解决了存在的问题.
Trajectory planning of robotic arm based on the gaussian decaying noise DDPG algorithm
The issue of DDPG algorithm training failure caused by inappropriate values of Gaussian noise standard deviation in the trajectory planning task of agricultural picking robotic arms was investigated.To address this problem,a decaying normal noise DDPG algorithm was proposed,where the Gaussian standard deviation σ decreases as the number of training episodes increases.Multiple simulation training sessions using the Mujoco physics engine were conducted to verify the advantages of decaying normal noise over traditional normal noise in trajectory planning tasks.The results showed that the improved algorithm was more effective in completing the trajectory planning task of the picking robotic arm,successfully solving the problem.

reinforcement learningDDPG algorithmgaussian noiserobotic armtrajectory planning

周雨溪、赵慧、韩晓峰

展开 >

武汉科技大学 湖北省机械传动与制造工程重点实验室,湖北 武汉 430081

武汉科技大学 机器人与智能系统研究院,湖北 武汉 430081

强化学习 DDPG算法 正态噪声 机械臂 轨迹规划

2024

农业装备与车辆工程
山东省农业机械科学研究所 山东农机学会

农业装备与车辆工程

影响因子:0.279
ISSN:1673-3142
年,卷(期):2024.62(10)