首页|小型无人有缆遥控水下机器人智能控制方法

小型无人有缆遥控水下机器人智能控制方法

扫码查看
针对深度确定性策略梯度(DDPG)算法应用于无人有缆遥控水下机器人(ROV)运动控制时存在的坏样本影响学习稳定性、缺少环境探索能力以及学习时间长难收敛等问题,从神经网络结构、噪声引入和融合监督学习3个方面对DDPG算法进行改进,并提出了基于混合神经网络结构和参数噪声的监督式DDPG算法.仿真结果表明,监督式DDPG算法比常规DDPG算法和传统比例-积分-微分(PID)算法更加有效.
Intelligent Control Method of Small Unmanned Cabled Remote-controled Underwater Robot
When the depth deterministic strategy gradient(DDPG)algorithm is applied to the motion control of unmanned cabled remote-controled underwater robot,several new problems such as the bad samples affect the learning stability,lack the ability to explore the environment are happened,and the learning time is difficult to cover the teaching of the algorithm.Hence,the DDPG algorithm is improved from three aspects:neural network structure,noise introduction and fusion supervised learning,and a supervised DDPG control algorithm based on hybrid neural network structure and parameter noise is proposed.The simulation results show that the improved DDPG algorithm is more effective than the conventional DDPG algorithm and the traditional PID algorithm.

depth deterministic strategy gradient(DDPG)algorithmhybrid neural networkparametric noisesupervised learningunmanned cabled remote-controled underwater robotmotion control

黄兆军、曾明如

展开 >

珠海城市职业技术学院机电工程学院,广东珠海 519090

南昌大学信息工程学院,南昌 330031

深度确定性策略梯度算法 混合神经网络 参数噪声 监督学习 无人有缆遥控水下机器人 运动控制

2023年广东省普通高校特色创新项目

2023KTSCX330

2024

实验室研究与探索
上海交通大学

实验室研究与探索

CSTPCD北大核心
影响因子:1.69
ISSN:1006-7167
年,卷(期):2024.43(7)