中国科学:信息科学(英文版)2024,Vol.67Issue(8) :143-167.DOI:10.1007/s11432-022-3859-8

SeeMore:a spatiotemporal predictive model with bidirectional distillation and level-specific meta-adaptation

Yuqing MA Wei LIU Yajun GAO Yang YUAN Shihao BAI Haotong QIN Xianglong LIU
中国科学:信息科学(英文版)2024,Vol.67Issue(8) :143-167.DOI:10.1007/s11432-022-3859-8

SeeMore:a spatiotemporal predictive model with bidirectional distillation and level-specific meta-adaptation

Yuqing MA 1Wei LIU 1Yajun GAO 1Yang YUAN 1Shihao BAI 1Haotong QIN 1Xianglong LIU1
扫码查看

作者信息

  • 1. State Key Lab of Software Development Environment,Beihang University,Beijing 100191,China
  • 折叠

Abstract

Predicting future frames using historical spatiotemporal data sequences is challenging and criti-cal,and it is receiving a lot of attention these days from academic and industrial scholars.Most spatiotempo-ral predictive algorithms ignore the valuable backward reasoning ability and the disparate learning complex-ities among different layers and hence,cannot build good long-term dependencies and spatial correlations,resulting in suboptimal solutions.To address the aforementioned issues,we propose a two-stage coarse-to-fine spatiotemporal predictive model with bidirectional distillation and level-specific meta-adaptation(SeeMore)in this paper,which includes a bidirectional distillation network(BDN)and a level-specific meta-adapter(LMA),to gain bidirectional multilevel reasoning.In the first stage,BDN concentrates on bidirectional dynamics modeling and coarsely constructs spatial correlations of different layers,while LMA is introduced in the second fine-tuning stage to refine the multilevel spatial correlations from a meta-learning perspective.In particular,BDN mimics the forward and backward reasoning abilities of humans in a distillation manner,which aids in the development of long-term dependencies.The LMA views learning of different layers as disparate but related tasks and guides the transfer of learning experiences among these tasks through learning complexities.Thus,each layer could be closer to its solutions and could extract more informative spatial cor-relations.By capturing the enhanced short-term spatial correlations and long-term temporal dependencies,the proposed model could extract adequate knowledge from sequential historical observations and accurately predict future frames whose backtracking preconditions are consistent with the historical sequence.Our work is general and robust enough to be integrated into most spatiotemporal predictive models without requiring additional computation or memory cost during inference.Extensive experiments on four widely used pre-dictive learning benchmarks validated the proposed model's effectiveness in comparison to state-of-the-art approaches(e.g.,10.6%improvement of Mean Squared Error on the Moving MNIST dataset).

Key words

spatiotemporal predictive learning/knowledge transfer/bidirectional distillation network/level-specific meta-adapter/coarse-to-fine training

引用本文复制引用

基金项目

National Natural Science Foundation of China(62206010)

National Natural Science Foundation of China(62022009)

出版年

2024
中国科学:信息科学(英文版)
中国科学院

中国科学:信息科学(英文版)

CSTPCDEI
影响因子:0.715
ISSN:1674-733X
参考文献量1
段落导航相关论文