中国科学:技术科学(英文版)2024,Vol.67Issue(1) :105-120.DOI:10.1007/s11431-023-2502-9

Causal reasoning in typical computer vision tasks

ZHANG KeXuan SUN QiYu ZHAO ChaoQiang TANG Yang
中国科学:技术科学(英文版)2024,Vol.67Issue(1) :105-120.DOI:10.1007/s11431-023-2502-9

Causal reasoning in typical computer vision tasks

ZHANG KeXuan 1SUN QiYu 1ZHAO ChaoQiang 2TANG Yang1
扫码查看

作者信息

  • 1. Key Laboratory of Advanced Control and Optimization for Chemical Process,Ministry of Education,East China University of Science and Technology,Shanghai 200237,China
  • 2. National Key Laboratory of Air based Information Perception and Fusion,Luoyang 471000,China;Luoyang Institute of Electro Optical Equipment of Avic,Luoyang 471000,China
  • 折叠

Abstract

Deep learning has revolutionized the field of artificial intelligence.Based on the statistical correlations uncovered by deep learning-based methods,computer vision tasks,such as autonomous driving and robotics,are growing rapidly.Despite being the basis of deep learning,such correlation strongly depends on the distribution of the original data and is susceptible to un-controlled factors.Without the guidance of prior knowledge,statistical correlations alone cannot correctly reflect the essential causal relations and may even introduce spurious correlations.As a result,researchers are now trying to enhance deep leaming-based methods with causal theory.Causal theory can model the intrinsic causal structure unaffected by data bias and effectively avoids spurious correlations.This paper aims to comprehensively review the existing causal methods in typical vision and vision-language tasks such as semantic segmentation,object detection,and image captioning.The advantages of causality and the approaches for building causal paradigms will be summarized.Future roadmaps are also proposed,including facilitating the development of causal theory and its application in other complex scenarios and systems.

Key words

causal reasoning/computer vision tasks/vision-language tasks/semantic segmentation/object detection

引用本文复制引用

基金项目

National Natural Science Foundation of China(62233005)

National Natural Science Foundation of China(62293502)

Programme of Introducing Talents of Discipline to Universities(the 111 Project)(B17017)

Fundamental Research Funds for the Central Universities(222202317006)

Shanghai AI Lab()

出版年

2024
中国科学:技术科学(英文版)
中国科学院

中国科学:技术科学(英文版)

CSTPCDEI
影响因子:1.056
ISSN:1674-7321
参考文献量2
段落导航相关论文