首页|基于多人零和博弈的模块化机器人系统近似最优控制

基于多人零和博弈的模块化机器人系统近似最优控制

扫码查看
提出一种基于多人零和博弈的模块化机器人(Modular Robot Manipulators,MRMs)系统近似最优控制方法.建立了具有交联耦合(Interconnected Dynamic Couplings,IDC)的模块化机器人系统动力学模型.将机器人系统的控制律和IDC效应作为零和博弈的参与者,MRM系统的最优跟踪控制问题转化为多人零和博弈问题.根据自适应动态规划(Adaptive Dynamic Programming,ADP)算法,通过建立评判神经网络求解哈密顿-雅克比-埃塞克斯(Hamilton-Jacobi-Issacs,HJI)方程,推导出最优控制律.基于李雅普诺夫定理,证明了闭环机器人系统是渐近稳定的,最后通过实验验证了所提控制方法的有效性.
Approximate optimal control for Modular Robot Manipulators based on multiplayer zero-sum game
An approximate optimal control method for Modular Robot Manipulators(MRMs)systems based on multiplayer zero-sum game is proposed.A modular robot system dynamic model with Interconnected Dynamic Couplings(IDC)is developed.The control law and IDC effect are regarded as players in zero-sum game.The approximate optimal tracking control problem of the MRM system can be transformed into a multiplayer zero-sum game.According to the Adaptive Dynamic Programming(ADP)algorithm,the Hamilton-Jacobi-Issacs(HJI)equation can be solved by establishing critic neural network and then the approximate optimal control policy can be derived.Based on the Lyapunov theorem,the closed-loop robotic system is proved to be asymptotic stable.Finally,experiments are conducted to verify the effectiveness of the proposed method.

ADP(Adaptive Dynamic Programming)MRMs(Modular Robot Manipulators)multiplayer zero-sum gameoptimal control

董博、朱新野、马冰、安天骄

展开 >

长春工业大学电气与电子工程学院,吉林长春 130012

自适应动态规划 模块化机器人 多人零和博弈 最优控制

国家自然科学基金项目吉林省科技发展计划项目吉林省教育厅"十三五"科学计划项目

6217304720220201038OXJJKH20220689KJ

2024

长春工业大学学报
长春工业大学

长春工业大学学报

影响因子:0.282
ISSN:1674-1374
年,卷(期):2024.45(2)