防务技术2024,Vol.34Issue(4) :19-28.DOI:10.1016/j.dt.2023.05.016

Reinforcement learning based adaptive control for uncertain mechanical systems with asymptotic tracking

Xiang-long Liang Zhi-kai Yao Yao-wen Ge Jian-yong Yao
防务技术2024,Vol.34Issue(4) :19-28.DOI:10.1016/j.dt.2023.05.016

Reinforcement learning based adaptive control for uncertain mechanical systems with asymptotic tracking

Xiang-long Liang 1Zhi-kai Yao 2Yao-wen Ge 1Jian-yong Yao1
扫码查看

作者信息

  • 1. School of Mechanical Engineering,Nanjing University of Science and Technology,Nanjing 210094,China
  • 2. College of Automation & College of Artificial Intelligence,Nanjing University of Post and Telecommunication,Nanjing,210023,China
  • 折叠

Abstract

This paper mainly focuses on the development of a learning-based controller for a class of uncertain mechanical systems modeled by the Euler-Lagrange formulation.The considered system can depict the behavior of a large class of engineering systems,such as vehicular systems,robot manipulators and satellites.All these systems are often characterized by highly nonlinear characteristics,heavy modeling uncertainties and unknown perturbations,therefore,accurate-model-based nonlinear control ap-proaches become unavailable.Motivated by the challenge,a reinforcement learning(RL)adaptive control methodology based on the actor-critic framework is investigated to compensate the uncertain me-chanical dynamics.The approximation inaccuracies caused by RL and the exogenous unknown distur-bances are circumvented via a continuous robust integral of the sign of the error(RISE)control approach.Different from a classical RISE control law,a tanh(·)function is utilized instead of a sign(·)function to acquire a more smooth control signal.The developed controller requires very little prior knowledge of the dynamic model,is robust to unknown dynamics and exogenous disturbances,and can achieve asymptotic output tracking.Eventually,co-simulations through ADAMS and MATLAB/Simulink on a three degrees-of-freedom(3-DOF)manipulator and experiments on a real-time electromechanical servo sys-tem are performed to verify the performance of the proposed approach.

Key words

Adaptive control/Reinforcement learning/Uncertain mechanical systems/Asymptotic tracking

引用本文复制引用

基金项目

国家重点研发计划(2021YFB2011300)

国家自然科学基金(52075262)

出版年

2024
防务技术
中国兵工学会

防务技术

CSTPCD
影响因子:0.358
ISSN:2214-9147
段落导航相关论文