系统科学与数学2024,Vol.44Issue(1) :17-30.DOI:10.12341/jssms23366

基于自适应动态规划的随机时滞线性二次型最优跟踪控制

Stochastic Linear Quadratic Optimal Tracking Control with Time-Delays Based on Adaptive Dynamic Programming

谭旭峰 李媛 刘洋
系统科学与数学2024,Vol.44Issue(1) :17-30.DOI:10.12341/jssms23366

基于自适应动态规划的随机时滞线性二次型最优跟踪控制

Stochastic Linear Quadratic Optimal Tracking Control with Time-Delays Based on Adaptive Dynamic Programming

谭旭峰 1李媛 1刘洋1
扫码查看

作者信息

  • 1. 沈阳工业大学理学院,沈阳 110870
  • 折叠

摘要

针对模型未知且带有时滞的随机线性二次型(SLQ)最优跟踪控制问题,提出了一种自适应动态规划(ADP)算法.首先,利用双因果坐标变换导出原时滞系统的等效系统,构造一个新的由等效系统和命令生成器组成的增广系统,并给出该增广系统的随机代数方程.其次,为了解决随机线性二次最优跟踪控制问题,将随机问题转化为确定性问题.然后提出ADP算法,并给出该算法的收敛性分析.为了实现ADP算法,设计了三种神经网络,分别近似最优性能指标函数,最优控制增益矩阵和系统模型.最后,通过一个数值算例验证算法的有效性.

Abstract

An adaptive dynamic programming(ADP)algorithm is proposed for a class of model-free stochastic linear quadratic(SLQ)optimal tracking problem with time-delay.Firstly,the equivalent system of the original time-delay system is de-rived using the double causal coordinate transformation.A new augmented system consisting of the equivalent system and the command generator is constructed,and then the stochastic algebraic equations of the augmented system are given.Secondly,in order to solve the SLQ tracking control problem,the stochastic problem is trans-formed into deterministic problem.Then the ADP algorithm is proposed and its convergence analysis is given.For the purpose of realizing the ADP algorithm,three neural networks are designed,which approximate the optimal cost function,the opti-mal control gain matrix and the system model respectively.Finally,the effectiveness of the algorithm is verified by a numeric example.

关键词

随机线性系统/时滞/自适应动态规划/神经网络

Key words

Stochastic linear systems/time-delay/adaptive dynamic programming/neural networks

引用本文复制引用

基金项目

国家自然科学基金(62103289)

出版年

2024
系统科学与数学
中国科学院数学与系统科学研究院

系统科学与数学

CSTPCD北大核心
影响因子:0.425
ISSN:1000-0577
参考文献量23
段落导航相关论文