小型无人有缆遥控水下机器人智能控制方法

Intelligent Control Method of Small Unmanned Cabled Remote-controled Underwater Robot

扫码查看

原文链接

维普
万方数据

中文摘要：针对深度确定性策略梯度(DDPG)算法应用于无人有缆遥控水下机器人(ROV)运动控制时存在的坏样本影响学习稳定性、缺少环境探索能力以及学习时间长难收敛等问题,从神经网络结构、噪声引入和融合监督学习3个方面对DDPG算法进行改进,并提出了基于混合神经网络结构和参数噪声的监督式DDPG算法.仿真结果表明,监督式DDPG算法比常规DDPG算法和传统比例-积分-微分(PID)算法更加有效.

外文摘要：When the depth deterministic strategy gradient(DDPG)algorithm is applied to the motion control of unmanned cabled remote-controled underwater robot,several new problems such as the bad samples affect the learning stability,lack the ability to explore the environment are happened,and the learning time is difficult to cover the teaching of the algorithm.Hence,the DDPG algorithm is improved from three aspects:neural network structure,noise introduction and fusion supervised learning,and a supervised DDPG control algorithm based on hybrid neural network structure and parameter noise is proposed.The simulation results show that the improved DDPG algorithm is more effective than the conventional DDPG algorithm and the traditional PID algorithm.

外文关键词：

depth deterministic strategy gradient(DDPG)algorithmhybrid neural networkparametric noisesupervised learningunmanned cabled remote-controled underwater robotmotion control

作者：

黄兆军、曾明如

展开 >

作者单位：

珠海城市职业技术学院机电工程学院,广东珠海 519090

南昌大学信息工程学院,南昌 330031

关键词：

深度确定性策略梯度算法混合神经网络参数噪声监督学习无人有缆遥控水下机器人运动控制

基金：

2023年广东省普通高校特色创新项目

项目编号：

2023KTSCX330

出版年：

2024

DOI：

10.19927/j.cnki.syyt.2024.07.007

实验室研究与探索

上海交通大学

实验室研究与探索

CSTPCD北大核心

影响因子：1.69

ISSN：1006-7167

年,卷(期)：2024.43(7)