基于自适应动态规划的随机时滞线性二次型最优跟踪控制

扫码查看

原文链接

万方数据
维普

中文摘要：针对模型未知且带有时滞的随机线性二次型(SLQ)最优跟踪控制问题,提出了一种自适应动态规划(ADP)算法.首先,利用双因果坐标变换导出原时滞系统的等效系统,构造一个新的由等效系统和命令生成器组成的增广系统,并给出该增广系统的随机代数方程.其次,为了解决随机线性二次最优跟踪控制问题,将随机问题转化为确定性问题.然后提出ADP算法,并给出该算法的收敛性分析.为了实现ADP算法,设计了三种神经网络,分别近似最优性能指标函数,最优控制增益矩阵和系统模型.最后,通过一个数值算例验证算法的有效性.

外文标题：Stochastic Linear Quadratic Optimal Tracking Control with Time-Delays Based on Adaptive Dynamic Programming

外文摘要：An adaptive dynamic programming(ADP)algorithm is proposed for a class of model-free stochastic linear quadratic(SLQ)optimal tracking problem with time-delay.Firstly,the equivalent system of the original time-delay system is de-rived using the double causal coordinate transformation.A new augmented system consisting of the equivalent system and the command generator is constructed,and then the stochastic algebraic equations of the augmented system are given.Secondly,in order to solve the SLQ tracking control problem,the stochastic problem is trans-formed into deterministic problem.Then the ADP algorithm is proposed and its convergence analysis is given.For the purpose of realizing the ADP algorithm,three neural networks are designed,which approximate the optimal cost function,the opti-mal control gain matrix and the system model respectively.Finally,the effectiveness of the algorithm is verified by a numeric example.

外文关键词：

Stochastic linear systemstime-delayadaptive dynamic programmingneural networks

作者：

谭旭峰、李媛、刘洋

展开 >

作者单位：

沈阳工业大学理学院,沈阳 110870

关键词：

随机线性系统时滞自适应动态规划神经网络

基金：

国家自然科学基金

项目编号：

62103289

出版年：

2024

DOI：

10.12341/jssms23366

系统科学与数学

中国科学院数学与系统科学研究院

系统科学与数学

CSTPCD北大核心

影响因子：0.425

ISSN：1000-0577

年,卷(期)：2024.44(1)

参考文献量23