Multi-Scale Attention Recurrent Network with Multi-order Taylor Differential Knowledge for Deep Spatiotemporal Sequence Prediction
Deep spatiotemporal sequence prediction methods that incorporate a priori physical knowledge are commonly characterized by the utilization of Partial Differential Equations(PDE)for modeling.However,two main issues are concerned:(1)the limited precision in approximations with PDEs;and(2)the inability to efficiently capture spatiotemporal features at multiple spatial scales as well as the edge spatial information of the spatiotemporal sequences in the recurrent network.To address these challenges,one Taylor Differential Incorporated Convolutional Recurrent Neural Network(TDI-CRNN)is proposed in this paper.Firstly,in order to enhance the approximation accuracy of higher-order partial differential equations and to alleviate the limitations of PDE applications,one physical module with multi-order Taylor approximation is designed.The module is firstly used for the differential approximation of the input sequence by means of the Taylor expansion,and then couples the differential convolution layers with different orders via differential coefficients,and dynamically adjusts the truncation order and the number of differential terms of the Taylor expansions.Secondly,to capture the multiple spatial scale features of the hidden states in the recurrent network and to better capture the edge spatial information of the spatiotemporal sequences,one Multi-Scale Attention Recurrent Module(MSARM)is devised.Multi-scale convolution and spatial attention mechanisms are utilized in the convolution layer of the Multi-scale Convolution Spatial Attention UNet(MCSA-UNet),aiming to focus on local spatial regions within spatiotemporal sequences.Extensive experiments are conducted on the Moving MNIST,KTH,and CIKM datasets.The Mean Squared Error(MSE)on the Moving MNIST dataset dropped to 42.7,while the Structural Similarity Index Measure(SSIM)increased to 0.912.The SSIM and Peak Signal-to-Noise Ratio(PSNR)on the KTH dataset increased to 0.882 and 29.03,respectively.The Correct Skill Index(CSI)on the real weather radar echo CIKM dataset increased to 0.515.The final visualization and quantitative prediction results verify the rationality and effectiveness of the TDI-CRNN model.
Spatiotemporal sequences predictionLong Short Term Memory(LSTM)Knowledge-guidedPartial differential equationTaylor expansion