基于深度学习的图像匹配:方法、应用与挑战

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：图像匹配旨在建立图像之间的点对应关系,是许多计算机视觉任务的关键环节.近年来,随着深度学习技术的发展,图像匹配方法已从以手工设计特征为主转变为基于深度网络的方法,基于深度学习的图像匹配方法在多个标准数据集上展现出卓越的性能,推动着多个相关应用的发展.围绕图像匹配涉及的若干关键问题,如:特征点检测、特征点描述、稠密点匹配、误匹配去除,本文对深度学习图像匹配方法进行了系统性总结.首先分析了领域内基于深度学习的典型方法和关键技术,随后介绍了与图像匹配密切相关的几个典型应用并给出其现状分析,最后,根据对图像匹配领域技术发展的分析总结,结合作者在该领域的长期研究积累,本文给出了目前图像匹配所面临的主要挑战以及未来发展趋势.

外文标题：Image Matching in Deep Learning Era:Methods,Applications and Challenges

外文摘要：Image matching is a crucial technique within the field of computer vision,primarily focused on identifying and establishing point correspondences between two different images depicting the same scene.It seeks to find points in one image that correspond to points in another,thus enabling a wide range of computer vision tasks that rely on the analysis of multiple images of the same object or scene from different viewpoints or at different times,including but not limited to 3D reconstruction,motion tracking,image stitching for panoramic views,and visual localization.Traditionally,this process has leaned heavily on the use of hand-crafted keypoint detectors and local descriptors,i.e.,algorithms and methodologies designed to pinpoint and describe discriminative features within a local image region,aiming to achieve invariance to scale,rotation,and changes in lighting and perspective.In recent years,with the revolutionary development of deep learning in many areas of computer vision,image matching methods have switched from handcrafted design style to relying on deep learning.The advent of deep learning technologies has catalyzed significant advancements in the area of image matching,and numerous deep learning based image matching techniques have emerged,showcasing promising results across a wide range of benchmarks.This has also significantly accelerated the development of many downstream applications of image matching,notably including structure from motion,visual localization,and simultaneous localization and mapping(SLAM),among others.This paper aims to provide a comprehensive overview of deep learning-based image matching methods that have emerged in recent years.By delving into the core challenges of image matching,including keypoint detection,local feature description,dense matching,and mismatch removal,it offers a detailed summary of the innovative deep learning approaches devised to tackle these issues.This systematic review not only highlights the advancements in the field but also sheds light on how these cutting-edge methods have redefined the landscape of image matching,setting new benchmarks for accuracy,efficiency,and reliability.Specifically,it first delineates the problem definition of image matching and describes the main challenges.Then,it proceeds to dissect each problem associated with image matching,offering a thorough analysis of typical and emblematic methods.Additionally,it delves into the critical techniques employed by deep learning to address these issues,providing an in-depth exploration of how these innovative approaches can effectively solve the challenges inherent in image matching.Moreover,some highly related downstream tasks of image matching are described along with a detailed introduction of their state of the art.These downstream tasks include 3D reconstruction/structure from motion,image based localization,and simultaneous localization and mapping.Besides exploring these downstream applications,this paper provides a comprehensive description of popular benchmarks for image matching and its downstream tasks.Finally,the paper discusses the remaining challenges and future research directions.In conclusion,this paper presents itself as an invaluable resource for researchers and engineering technicians within related fields,enabling the swift assimilation of knowledge concerning the fundamentals,challenges,key technological advancements,and the current state of the art in image matching.As such,it can be served as a comprehensive resource for researchers venturing into this field,providing references in terms of research directions and dataset resources.Through its detailed exposition,the paper aims to catalyze further exploration and innovation,thereby could contributing significantly to the advancement of image matching and its application in advancing the frontiers of computer vision.

外文关键词：

image matchfeature point matchdense match3D reconstructionvisual localizationsimultaneous localization and mappingdeep learning

作者：

孔庆群、吴福朝、樊彬

展开 >

作者单位：

中国科学院自动化研究所北京 100190

中国科学院大学北京 100049

北京科技大学智能科学与技术学院北京 100083

关键词：

图像匹配特征点匹配稠密匹配三维重建视觉定位同时定位与建图深度学习

基金：

北京市自然科学基金国家自然科学基金国家自然科学基金

项目编号：

42020736222230261876180

出版年：

2024

DOI：

10.11897/SP.J.1016.2024.01485

计算机学报

中国计算机学会中国科学院计算技术研究所

计算机学报

CSTPCD北大核心

影响因子：3.18

ISSN：0254-4164

年,卷(期)：2024.47(7)