多期贝叶斯强化学习鲁棒投资组合选择模型

Multi-stage Bayesian Reinforcement Learning Robust Portfolio Selection Model

扫码查看

原文链接

维普
万方数据

中文摘要：在传统多期分布式鲁棒投资组合选择模型中,不确定集合的估计是一个具有挑战性的难题.使用贝叶斯强化学习方法来动态更新不确定集合中的一、二阶矩等模型参数,进而研究贝叶斯强化学习框架下均值-最坏鲁棒CVaR模型的求解问题.通过结合动态规划和渐进对冲算法,设计了两层分解求解框架.下层通过求解一系列二阶锥规划来得到给定模型参数下子问题的最优策略,上层使用贝叶斯公式得到可实施的非预期投资策略.基于美国股票市场的实证结果表明:多期鲁棒强化学习投资组合选择模型相较传统模型具有更好的样本外投资表现.

外文摘要：The estimation of uncertainty sets in traditional multi-stage distributionally robust portfolio selection models is a challenging problem.This paper applys the Bayesian reinforce-ment learning technique to dynamically update the first two order moments in the uncertainty sets of a multi-stage distributionally robust model.We study the mean-worst case robust CVaR model in the Bayesian reinforcement learning framework.We propose a two-level decomposition solution framework by combining dynamic programming techniques and the progressive hedg-ing algorithm.The lower level finds optimal policies of sub-models with given model parameters by solving a series of second-order cone programming problems.While the upper level finds an implementable policy satisfying non-anticipation constraints by using Bayes'law.Numerical results in the US stock market illustrate the superior out-of-sample investment performance of the multi-stage Bayesian reinforcement learning robust portfolio selection model.

外文关键词：

Bayesian reinforcement learningrobust risk measureportfolio selectionsecond-order cone programming

作者：

李柔佳、段启宏、冯卓航、刘嘉

展开 >

作者单位：

西安交通大学数学与统计学院,西安 710049

关键词：

贝叶斯强化学习鲁棒风险度量投资组合二阶锥规划

基金：

国家重点研发计划国家自然科学基金国家自然科学基金

项目编号：

2022YFA10040001199102312371324

出版年：

2024

DOI：

10.3969/j.issn.1005-3085.2024.02.003

工程数学学报

西安交通大学

工程数学学报

CSTPCD北大核心

影响因子：0.302

ISSN：1005-3085

年,卷(期)：2024.41(2)

参考文献量17