Borel状态空间中平均零和随机博弈的新条件

New conditions for zero-sum stochastic games with average criteria in Borel space

郭先平 ¹廖景浩 ¹谭梓祺 ¹温馨²

扫码查看

作者信息

1. 中山大学数学学院,广州 510275
2. 中山大学管理学院,广州 510275
折叠

摘要

本文研究Borel状态空间的离散时间Markov平均博弈.对报酬函数可以无界的一般情形,本文用平均最优双不等式取代相应的Shapley方程,提出比现有的几何遍历性条件更弱的新条件.在此新的条件下,本文建立上述平均最优双不等式的可解性,并由此证明平均博弈的值和Nash均衡策略的存在性.进而,在较强的几何遍历性条件下,用本文的最优双不等式,证明Shapley方程的可解性.最后,用电力系统与金融保险中的例子验证本文的条件,阐明本文的结果.

Abstract

In this paper,we study the expected average criterion in discrete-time Markov games with Borel spaces.For the general case of unbounded reward functions,we first replace the corresponding Shapley equation for the average criterion with average-optimality two-inequalities.Then,by using the relative difference of the values of the discounted games,we give a new set of optimality conditions,which are weaker than the geometric ergodicity condition in the existing literature.Under these new conditions,we not only establish the solvability of the average-optimality two-inequalities but also show the existence of both the value and a Nash equilibrium of the game.Moreover,under the stronger geometric ergodicity condition,by the average-optimality two-inequalities,we also establish the solvability of the Shapley equation.Finally,we present two examples of renewable resources and financial insurance to verify the conditions and illustrate the results in this paper.

关键词

零和平均随机博弈/最优性条件/平均最优双不等式/Shapley方程/Nash均衡策略

Key words

zero-sum average stochastic game/optimality conditions/average-optimality two-inequalities/Shapley equation/Nash equilibrium

引用本文复制引用

出版年

2024

中国科学(数学)

中国科学院

中国科学(数学)

CSTPCDCSCD北大核心

影响因子：0.221

ISSN：1674-7216

段落导航