Aiming at the contradiction between the temporal resolution and spatial resolution of images acquired by remote sensing satellites,a remote sensing image spatiotemporal fusion method combining progressive super-resolution and attention mechanism is proposed.The method includes progressive super-resolution and feature fusion modules.The former achieves a 16-fold spatial resolution improvement between MODIS and Landsat images by multiple 2-fold upsampling,while the latter fuses super-resolution features of five different scales.Experiments with two different spatiotemporal fusion algorithms on the CIA dataset show that the proposed method has less spectral loss and higher detail reconstruction capabilities.