基于深度强化学习的高铁客票动态定价算法

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据
维普

中文摘要：为了解决需求函数未知情况下的高铁客票动态定价问题.以最大化单列车期望收益为目标构建Markov多阶段决策模型并设计DQN(Deep Q Net)强化学习框架寻找动态定价最优策略.算法以当日收益为奖励,通过神经网络来逼近所有状态-动作组合的期望最优收益.为验证算法性能,基于市场动态和旅客行为,开发高铁客运需求模拟系统并进行仿真实验.实验结果表明,智能体动态定价策略可以在不同需求水平下灵活调整价格,其性能接近理论上界并且显著优于对比策略.

外文标题：DYNAMIC PRICING ALGORITHM FOR HIGH-SPEED RAIL TICKETS BASED ON DEEP REINFORCEMENT LEARNING

外文摘要：This paper aims to solve the dynamic pricing problem of high-speed rail tickets under the unknown demand function.To maximize the expected return of a single train,we constructed a Markov multi-stage decision model and designed a DQN(Deep Q Net)reinforcement learning framework to find the optimal strategy for dynamic pricing.The algorithm used the day's income as the reward,and approximated the expected optimal return of all state-action combinations using a neural network.A high-speed rail passenger transport demand simulator was developed based on the market dynamics and passenger behavior for verifying the performance of the algorithm.The experimental results show that the agent dynamic pricing strategy can adjust the price flexibly under different demand levels,and its performance is close to the theoretical upper bound and better than the comparison strategy significantly.

外文关键词：

Revenue managementHigh-speed rail tickets pricingDynamic pricingDynamic planningRein-forcement learningEnvironment simulation algorithm

作者：

毕文杰、陈功

展开 >

作者单位：

中南大学商学院湖南长沙 410083

关键词：

收益管理高铁客票定价动态定价动态规划强化学习环境模拟算法

基金：

国家自然科学基金项目国家自然科学基金项目

项目编号：

7187123191646115

出版年：

2024

DOI：

10.3969/j.issn.1000-386x.2024.04.035

计算机应用与软件

上海市计算技术研究所上海计算机软件技术开发中心

计算机应用与软件

CSTPCD北大核心

影响因子：0.615

ISSN：1000-386X

年,卷(期)：2024.41(4)

参考文献量24