股指预测的创新深度学习策略:Transformer模型与GRU融合及其变体的效能探究

Innovative Deep Learning Strategies for Stock Index Prediction:Exploring the Efficacy of Transformer Model and GRU Integration and Their Variants

扫码查看

原文链接

万方数据

中文摘要：随着金融市场的不断发展和全球经济的变化,准确预测股市指数成为投资者和决策者关注的焦点之一.本文旨在探讨深度学习神经网络中的Transformer模型及其注意力机制在金融指数预测中的应用.通过摒弃常规的控制变量设计,转而采用基于历史股指数据的高阶自回归模型,本文创新性地提出了三种 Transformer 模型的变体:Multi-attention Transformer、GRU Transformer、Attention-Free Transformer,并对它们在单步迭代预测和多步一次预测两种方式下的表现进行比较.实证分析基于2000年1月1日至2024年3月11日的上证指数日度数据,通过将数据扩充和标准化,利用Python进行处理.结果显示:GRU Transformer模型结合单步迭代预测在测试集上的平均均方误差最低,为0.00041,且在参数数量和运行时间上均表现优异,表明其在预测准确性、参数效率和运行时间方面具有显著优势.本文的创新点包括:采用基于历史时间序列数据的高阶自回归模型简化模型结构,保持预测准确性;提出并验证了三种Trans-former模型变体在金融时间序列预测中的有效性;比较了单步迭代预测和多步一次预测两种方式的组合效果.本文研究为金融市场的分析和预测提供了新的视角和方法,未来研究可以进一步验证模型的有效性并探索其他潜在的改进策略.

外文摘要：With the continuous development of financial markets and changes in the global economy,accurately predicting stock market indices has become a key focus for investors and decision-makers.This study aims to explore the application of the Trans-former model and its attention mechanism in the prediction of financial indices within deep learning neural networks.By discard-ing conventional control variable designs and adopting a high-order autoregressive model based on historical stock index data,this paper innovatively proposes three Transformer model variants:Multi-attention Transformer,GRU Transformer,and Attention-Free Transformer,and compares their performance under both single-step iterative prediction and multi-step prediction methods.Empirical analysis is based on the daily data of the Shanghai Stock Exchange Index from January 1,2000,to March 11,2024.The data was expanded and standardized using Python.The results show that the GRU Transformer model combined with single-step iterative prediction has the lowest mean squared error(MSE)on the test set,at 0.00041,and performs excellently in terms of parameter efficiency and runtime,indicating significant advantages in prediction accuracy,parameter efficiency,and runtime.The innovations of this paper include simplifying the model structure while maintaining prediction accuracy by using a high-order au-toregressive model based on historical time series data;proposing and validating the effectiveness of three Transformer model variants in financial time series prediction;and comparing the effects of combining single-step iterative prediction and multi-step prediction methods.This study provides new perspectives and methods for financial market analysis and prediction,and future re-search can further validate the model's effectiveness and explore other potential improvement strategies.

外文关键词：

Higher-order Autoregressive ModelTransformer ModelAttention MechanismFinancial Index Prediction

作者：

肖哲坤

展开 >

作者单位：

北京大学汇丰商学院

关键词：

高阶自回归模型 Transformer模型注意力机制金融指数预测

出版年：

2024

工程经济

中国建设工程造价管理协会

工程经济

CHSSCD

影响因子：0.481

ISSN：1672-2442

年,卷(期)：2024.34(8)