首页|多期贝叶斯强化学习鲁棒投资组合选择模型

多期贝叶斯强化学习鲁棒投资组合选择模型

扫码查看
在传统多期分布式鲁棒投资组合选择模型中,不确定集合的估计是一个具有挑战性的难题.使用贝叶斯强化学习方法来动态更新不确定集合中的一、二阶矩等模型参数,进而研究贝叶斯强化学习框架下均值-最坏鲁棒CVaR模型的求解问题.通过结合动态规划和渐进对冲算法,设计了两层分解求解框架.下层通过求解一系列二阶锥规划来得到给定模型参数下子问题的最优策略,上层使用贝叶斯公式得到可实施的非预期投资策略.基于美国股票市场的实证结果表明:多期鲁棒强化学习投资组合选择模型相较传统模型具有更好的样本外投资表现.
Multi-stage Bayesian Reinforcement Learning Robust Portfolio Selection Model
The estimation of uncertainty sets in traditional multi-stage distributionally robust portfolio selection models is a challenging problem.This paper applys the Bayesian reinforce-ment learning technique to dynamically update the first two order moments in the uncertainty sets of a multi-stage distributionally robust model.We study the mean-worst case robust CVaR model in the Bayesian reinforcement learning framework.We propose a two-level decomposition solution framework by combining dynamic programming techniques and the progressive hedg-ing algorithm.The lower level finds optimal policies of sub-models with given model parameters by solving a series of second-order cone programming problems.While the upper level finds an implementable policy satisfying non-anticipation constraints by using Bayes'law.Numerical results in the US stock market illustrate the superior out-of-sample investment performance of the multi-stage Bayesian reinforcement learning robust portfolio selection model.

Bayesian reinforcement learningrobust risk measureportfolio selectionsecond-order cone programming

李柔佳、段启宏、冯卓航、刘嘉

展开 >

西安交通大学数学与统计学院,西安 710049

贝叶斯强化学习 鲁棒风险度量 投资组合 二阶锥规划

国家重点研发计划国家自然科学基金国家自然科学基金

2022YFA10040001199102312371324

2024

工程数学学报
西安交通大学

工程数学学报

CSTPCD北大核心
影响因子:0.302
ISSN:1005-3085
年,卷(期):2024.41(2)
  • 17