Research on key technologies of distributed training for Level2 market quotation factor mining
Level2 market quotation data is the new generation of real-time market data products from the Shanghai and Shenzhen Stock Exchanges.Serving as an enhanced version of basic market data,it currently has the highest information density,the greatest amount of information,and the most insuffi-cient mining in China.The data is of significant value in identifying potential risks in the securities mar-ket,but existing research lacks risk measurement and analysis based on it.Moreover,the scale of Lev-e12 market quotation data in the entire market is large,and the deep learning models used to extract in-formation are becoming increasingly complex.Although hardware computing power is constantly devel-oping and improving,it still cannot solve problems such as long training time and low efficiency.There-fore,based on Level2 market quotation data of CSI 300,deep learning and other methods are used to mine high-frequency volatility factors,and builds a high-frequency volatility prediction model based on TabNet and LightGBM.At the same time,a distributed training algorithm Parallel_DE based on paral-lel differential evolution is proposed for parameter calculation in the process of model distributed train-ing,its scene mapping scheme and overall process design are elaborated.The above two work are fully verified based on the proposed distributed training platform.The experimental results show that the high-frequency volatility prediction model can predict the realized volatility with high precision,and the effect has certain advantages compared with other methods;the Parallel_DE algorithm can effectively reduce the error of local parameters on the test set while retaining the diversity of parameters to a certain extent,so as to efficiently and distributedly train a deep learning model with excellent performance.This paper provides valuable technologies and methodologies for leveraging Leve12 market quotation data in risk identification within the securities market.