中国科学(数学)2024,Vol.54Issue(12) :1963-1978.DOI:10.1360/SSM-2024-0055

Borel状态空间中平均零和随机博弈的新条件

New conditions for zero-sum stochastic games with average criteria in Borel space

郭先平 廖景浩 谭梓祺 温馨
中国科学(数学)2024,Vol.54Issue(12) :1963-1978.DOI:10.1360/SSM-2024-0055

Borel状态空间中平均零和随机博弈的新条件

New conditions for zero-sum stochastic games with average criteria in Borel space

郭先平 1廖景浩 1谭梓祺 1温馨2
扫码查看

作者信息

  • 1. 中山大学数学学院,广州 510275
  • 2. 中山大学管理学院,广州 510275
  • 折叠

摘要

本文研究Borel状态空间的离散时间Markov平均博弈.对报酬函数可以无界的一般情形,本文用平均最优双不等式取代相应的Shapley方程,提出比现有的几何遍历性条件更弱的新条件.在此新的条件下,本文建立上述平均最优双不等式的可解性,并由此证明平均博弈的值和Nash均衡策略的存在性.进而,在较强的几何遍历性条件下,用本文的最优双不等式,证明Shapley方程的可解性.最后,用电力系统与金融保险中的例子验证本文的条件,阐明本文的结果.

Abstract

In this paper,we study the expected average criterion in discrete-time Markov games with Borel spaces.For the general case of unbounded reward functions,we first replace the corresponding Shapley equation for the average criterion with average-optimality two-inequalities.Then,by using the relative difference of the values of the discounted games,we give a new set of optimality conditions,which are weaker than the geometric ergodicity condition in the existing literature.Under these new conditions,we not only establish the solvability of the average-optimality two-inequalities but also show the existence of both the value and a Nash equilibrium of the game.Moreover,under the stronger geometric ergodicity condition,by the average-optimality two-inequalities,we also establish the solvability of the Shapley equation.Finally,we present two examples of renewable resources and financial insurance to verify the conditions and illustrate the results in this paper.

关键词

零和平均随机博弈/最优性条件/平均最优双不等式/Shapley方程/Nash均衡策略

Key words

zero-sum average stochastic game/optimality conditions/average-optimality two-inequalities/Shapley equation/Nash equilibrium

引用本文复制引用

出版年

2024
中国科学(数学)
中国科学院

中国科学(数学)

CSTPCDCSCD北大核心
影响因子:0.221
ISSN:1674-7216
段落导航相关论文