首页|基于自适应动态规划的随机时滞线性二次型最优跟踪控制

基于自适应动态规划的随机时滞线性二次型最优跟踪控制

扫码查看
针对模型未知且带有时滞的随机线性二次型(SLQ)最优跟踪控制问题,提出了一种自适应动态规划(ADP)算法.首先,利用双因果坐标变换导出原时滞系统的等效系统,构造一个新的由等效系统和命令生成器组成的增广系统,并给出该增广系统的随机代数方程.其次,为了解决随机线性二次最优跟踪控制问题,将随机问题转化为确定性问题.然后提出ADP算法,并给出该算法的收敛性分析.为了实现ADP算法,设计了三种神经网络,分别近似最优性能指标函数,最优控制增益矩阵和系统模型.最后,通过一个数值算例验证算法的有效性.
Stochastic Linear Quadratic Optimal Tracking Control with Time-Delays Based on Adaptive Dynamic Programming
An adaptive dynamic programming(ADP)algorithm is proposed for a class of model-free stochastic linear quadratic(SLQ)optimal tracking problem with time-delay.Firstly,the equivalent system of the original time-delay system is de-rived using the double causal coordinate transformation.A new augmented system consisting of the equivalent system and the command generator is constructed,and then the stochastic algebraic equations of the augmented system are given.Secondly,in order to solve the SLQ tracking control problem,the stochastic problem is trans-formed into deterministic problem.Then the ADP algorithm is proposed and its convergence analysis is given.For the purpose of realizing the ADP algorithm,three neural networks are designed,which approximate the optimal cost function,the opti-mal control gain matrix and the system model respectively.Finally,the effectiveness of the algorithm is verified by a numeric example.

Stochastic linear systemstime-delayadaptive dynamic programmingneural networks

谭旭峰、李媛、刘洋

展开 >

沈阳工业大学理学院,沈阳 110870

随机线性系统 时滞 自适应动态规划 神经网络

国家自然科学基金

62103289

2024

系统科学与数学
中国科学院数学与系统科学研究院

系统科学与数学

CSTPCD北大核心
影响因子:0.425
ISSN:1000-0577
年,卷(期):2024.44(1)
  • 23