小样本语义分割研究现状与分析

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：传统语义分割任务通常是数据驱动的,需要大规模密集标注样本训练,并且不能泛化到新类,因此在实际应用中受到很大限制.为了缓解传统语义分割中数据匮乏和泛化能力差的问题,人们提出小样本语义分割任务,在未见类别仅提供少量密集标注样本的情况下实现新类分割,在医疗图像分割、自动驾驶等应用领域扮演着重要的角色,已成为计算机视觉领域的重要研究方向之一.本文从基础知识、模型算法和拓展应用等方面对自然图像领域的小样本语义分割研究展开调查,具体包含以下内容:(1)介绍了小样本语义分割的背景知识,包括它的由来、核心思想、概念知识、存在挑战、数据集和性能评价指标;(2)详细分析和比较当前小样本语义分割算法,根据推理过程中是否存在梯度回传将其分为基于优化和基于度量学习的方法,并归纳了其发展现状和不同算法的优缺点;(3)介绍了小样本语义分割与其他技术融合的任务,包括小样本实例分割、广义小样本分割、增量小样本分割、弱监督小样本分割及跨域小样本分割;(4)讨论了小样本语义分割任务仍存在的问题和未来展望.

外文标题：Research Status and Analysis of Few-Shot Semantic Segmentation

外文摘要：The traditional semantic segmentation task is usually data-driven,which requires large-scale intensive annotation data for training.And it cannot be generalized to novel class segmentation which means that when novel class samples emerge,it is necessary to collect a large amount of data to retrain the model.So these issues severely limit its practical application.In order to alleviate the issues of data scarcity and poor generalization ability in traditional semantic segmentation task,the few-shot semantic segmentation(FSS)task has been proposed,which can utilize the past accumulated knowledge to achieve the segmentation of novel classes in images that are not included in the training process with only a limited number of densely labeled samples.FSS can significantly reduce the need for data annotation and demonstrates good generalization performance.It has attracted widespread attention from the academic community.This research filed is crucial for practical applications where data availability is limited and difficult,such as medical field,remote sensing filed and so on.In recent years,there has been rapid development and progress in the field of few-shot semantic segmentation.A large number of novel and high-performing methods have emerged,urgently requiring a comprehensive review to summarize and sort out these algorithms.This paper investigates the research on few-shot semantic segmentation in the field of natural images from the aspects of basic knowledge,algorithms summarization,extended applications and future development.We firstly introduce the basic knowledge of few-shot semantic segmentation task,including its origin,core ideas,conceptual knowledge,challenges,dataset benchmarks,and evaluation metrics.Then we have analyzed,compared the current few-shot semantic segmentation methods in detail,and summarized them accordingly.Specifically,we divide those methods into optimization-based and metric-learning-based methods according to whether there is gradient backpropagation during inference.When it comes to optimization-based methods,there is a need for gradient backpropagation to optimize the model during inference,whereas metric-learning-based approaches do not require any optimization of the model during inference.Instead,they utilize densely labeled images to obtain information about the categories to be segmented and use this class-specific information to guide the segmentation process.This allows for a more efficient and direct utilization of the labeled data without the need for further model training or optimization.We have conducted a comparison of the introduced few-shot semantic segmentation algorithms,listing the best performance achieved by each algorithm and providing quantitative analysis on three datasets.The comparison was made from three perspectives:the performance comparison on each dataset under different evaluation metrics,the performance comparison of a single algorithm across different dataset benchmarks,and the results comparison of different algorithms on the PASCAL-5i dataset.In addition,we introduce some tasks of integrating few-shot semantic segmentation and other technologies,such as few-shot instance segmentation,generalized few-shot semantic segmentation,incremental few-shot semantic segmentation,weakly supervised few-shot semantic segmentation,and cross-domain few-shot semantic segmentation.Finally,we discuss the existing problems and future development trend of few-shot semantic segmentation.We believe that this paper will help researchers better understand the current research status of few-shot semantic segmentation,provide a broader perspective for future studies,and promote further development and innovation in this field.

外文关键词：

few-shot learningimage semantic segmentationmeta learningtransfer learningdeep learning

作者：

陈善娟、于云龙、李英明

展开 >

作者单位：

浙江大学信息与电子工程学院杭州 310058

关键词：

小样本学习图像语义分割元学习迁移学习深度学习

基金：

浙江省自然科学基金项目国家自然科学基金国家自然科学基金浙江省省重点研发计划项目浙江省省重点研发计划项目宁波市重点研发计划项目

项目编号：

LD24F02001662002320U19B20432023C010432021C011192023Z236

出版年：

2024

DOI：

10.11897/SP.J.1016.2024.02417

计算机学报

中国计算机学会中国科学院计算技术研究所

计算机学报

CSTPCD北大核心

影响因子：3.18

ISSN：0254-4164

年,卷(期)：2024.47(10)