广东工业大学学报2024,Vol.41Issue(2) :129-138.DOI:10.12052/gdutxb.230010

转移概率部分未知的离散时间Markov跳变系统Nash微分博弈

Nash Differential Game for Discrete-time Markov Jump System with Partially Unknown Transition Probabilities

张成科 徐萌 杨璐
广东工业大学学报2024,Vol.41Issue(2) :129-138.DOI:10.12052/gdutxb.230010

转移概率部分未知的离散时间Markov跳变系统Nash微分博弈

Nash Differential Game for Discrete-time Markov Jump System with Partially Unknown Transition Probabilities

张成科 1徐萌 2杨璐3
扫码查看

作者信息

  • 1. 广东工业大学 经济学院,广东 广州 510520
  • 2. 广东工业大学 管理学院,广东 广州 510520
  • 3. 广东技术师范大学 管理学院,广东 广州 510180
  • 折叠

摘要

考虑到转移概率矩阵元素无法完全获悉,如何在转移概率部分未知的情境下研究离散时间Markov跳变系统Nash微分博弈是有待解决的问题之一,这一问题可以为转移概率部分未知的Markov跳变系统Nash微分博弈理论在管理问题上的应用提供理论支撑.基于此,本文首先研究单人博弈情形,即ε-次优控制问题,借助自由连接权矩阵和配方法,得到了ε-次优控制策略存在的充分性条件,并给出了成本函数上界的显式表达;然后延伸至双人博弈进行分析,得到了ε-次优Nash均衡策略存在的条件等价于求解双线性矩阵不等式和矩阵不等式的优化问题,并通过启发式算法求解优化问题得到ε-次优Nash均衡策略;最后通过数值算例证明了主要结论的有效性.

Abstract

It is noted that transition probability matrix elements cannot be fully known.How to study Nash differential game for discrete-time Markov jump system(MJS)under the condition of unknown transition probability is one of the problems to be solved.This problem can provide theoretical support for the application of Nash differential game theory in Markov jump systems with partial unknown transition probability to management problems.Based on it,the case of one-player game,which is called the ε-suboptimal control problem,is firstly studied.By using the free-connection weighting matrix and"complete square"method,the sufficient conditions for the existence of the ε-suboptimal cntrol strategy are obtained,and an explicit expression of the upper bound of the cost function is given.Then,the conditions for the existence of ε-suboptimal Nash equilibrium strategy are equivalent to solving the optimization problem,which satisfied the bilinear matrix inequalities(BMIs)and matrix inequalities.The heuristic algorithm is used to solve the optimization problem to obtain the ε-suboptimal Nash equilibrium strategies.Finally,the numerical examples are provided to demonstrate the validity of the main conclusions.

关键词

离散时间Markov跳变系统/ε-次优控制/ε-次优Nash均衡

Key words

discrete time Markov jump system/ε-suboptimal control/ε-suboptimal Nash equilibrium

引用本文复制引用

基金项目

国家自然科学基金(71571053)

国家社会科学基金后期资助暨优秀博士论文资助项目(21FJYB025)

广东省基础与应用基础研究基金(2023A1515012335)

出版年

2024
广东工业大学学报
广东工业大学

广东工业大学学报

影响因子:0.628
ISSN:1007-7162
参考文献量21
段落导航相关论文