分割一切模型(SAM)在医学图像分割中的应用

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：医学图像分割是计算机辅助医疗流程中的关键步骤,精准的医学图像分割可以为诊断与治疗提供帮助.分割一切模型(SAM)利用提示驱动的基础大模型进行下游的分割任务,它的出现为医学图像分割提出了与神经网络不同的新方向.但是,SAM是以自然图像为基础的模型,对医学图像的处理效果还有待提高.本文介绍了 SAM在医学图像上直接应用的效果,并总结了将SAM应用到医学图像分割任务的研究工作.与此同时,介绍了本课题组在乳腺肿瘤数据集与孕妇骨盆数据集上进行的两个实验,验证了大模型经过大量数据微调后具有更好的泛化能力.半监督网络与SAM结合生成高质量的伪标签能够有效提高分割效果.虽然目前SAM在医学图像分割领域已取得较好效果,但进一步提升存在一定困难.本文最后分析了 SAM面临的挑战并讨论了 SAM在医学图像分割中的潜在发展方向,希望有助于医疗分割技术的进步.

外文标题：Application of Segment Anything Model in Medical Image Segmentation

外文摘要：Significance The application of deep neural networks to image segmentation is one of the most prevalent topics in medical imaging.As an initial step in computer-aided detection processes,medical image segmentation aims to identify contours or regions of interest within images,thereby providing valuable assistance to clinicians in image interpretation,surgical planning,and clinical decision-making.Deep neural networks,which leverage their powerful ability to learn complex image features,have demonstrated outstanding performance in medical image segmentation.However,the use of deep neural networks for medical image segmentation has two significant limitations.First,different medical imaging modalities and specific segmentation tasks exhibit diverse image characteristics,leading to the low generalization capabilities of deep neural networks,which are often tailored to specific tasks.Second,increasingly complex network architectures with notable segmentation efficacy demand significant amounts of annotated image data,particularly those that require laborious manual annotation by medical experts.With the rapid advancement of large-scale pretrained foundation models(LPFMs)in the field of artificial intelligence,an increasing number of tasks have achieved superior results through the fine-tuning of LPFMs.LPFMs are generic models trained on massive amounts of data and acquire foundational and versatile representational capabilities that can be transferred across different domains.Consequently,various downstream tasks can be easily fine-tuned using universal models.Considering the challenges in medical image segmentation,including low model generalization and difficulty in dataset acquisition,universal LPFMs are urgently needed in the field of medical image segmentation to facilitate breakthroughs in artificial intelligence applied to medical imaging.Since its introduction as a foundational large model in the field of natural image segmentation,the segment anything model(SAM)has been applied across various domains with remarkable results.Although SAM has demonstrated powerful capabilities in natural image segmentation,its direct application to medical image segmentation tasks has yielded less-than-satisfactory outcomes.This can be attributed to two main factors.First,the training datasets contain shortcomings.SAM lacks sufficient representation of medical images in its training data,and medical images often exhibit blurry edges,which differ significantly from the clear edges present in natural images.Second,the characteristics of SAM prompts play a crucial role in segmentation performance.Only by judiciously selecting prompt strategies can the full potential of SAM be realized.For these two reasons,significant efforts have been directed toward fine-tuning SAM,adapting SAM to three-dimensional(3D)medical datasets,expanding SAM functionalities,and optimizing prompting strategies.Comprehensive review articles have summarized these endeavors,such as the study by Zhang et al.,which extensively outlined advancements in fine-tuning SAM,expanding its functionalities,optimizing prompting strategies,and distilling the challenges faced by SAM in the field of medical image segmentation.However,a systematic summary of methods for applying SAM to 3D medical datasets is lacking.Zhang et al.primarily elaborated on the fine-tuning of SAM,its application to 3D medical datasets,and related automatic prompting strategies.Nevertheless,as research on SAM deepens and its performance across various datasets improves,efforts in fine-tuning SAM,adapting it to 3D datasets,and optimizing prompting strategies have become more sophisticated.In addition,SAM has been extended to integrate semi-supervised learning methods and has been applied to novel directions such as interactive clinical healthcare.To summarize comprehensively the progress of SAM adaptation to medical image segmentation as well as to address existing challenges and provide directions for further research,a review that specifically focuses on the application of SAM to medical image segmentation is essential.Progress This study extensively reviewed more than one hundred articles focusing on the utilization of SAM for medical image segmentation.Initially,this study furnished an exhaustive exposition of the SAM architecture and delineated its direct application to medical image datasets(Table 1).Then,an in-depth analysis of SAM's adaptation to medical image segmentation was conducted,emphasizing innovative refinements in fine-tuning techniques,SAM's integration into 3D medical datasets,and its amalgamation with semi-supervised learning methodologies(Fig.3)alongside other emerging avenues.Experimental evaluations on two proprietary medical image datasets validated the enhanced generalization capabilities of the large models after extensive data fine-tuning(Table 2).In addition,the study confirmed the effectiveness of combining SAM with semi-supervised networks in generating high-quality pseudo-labels,thereby augmenting the segmentation performance(Table 3).Finally,the study delved into the current limitations,identified areas requiring improvement,elucidated the challenges encountered in SAM's adaptation to medical image segmentation,and proposed future directions,including the construction of large-scale datasets,enhancement of multi-modal and multi-scale information processing,integration of SAM with semi-supervised network structures,and expansion of SAM's application in clinical settings.Conclusions and Prospects SAM is progressively being established as a potent asset in the field of medical image segmentation.In summary,although the integration of SAM into medical image segmentation holds great promise,it continues to face many challenges.Addressing these challenges requires a more comprehensive investigation and more refined approach,thus paving the way for effective implementation and further evolution of large-scale models in the domain of medical segmentation.

外文关键词：

segment anything modelmedical image segmentationfoundation modelsdeep learning

作者：

吴曈、胡浩基、冯洋、罗琼、徐栋、郑伟增、金能、杨琛、姚劲草

展开 >

作者单位：

浙江大学伊利诺伊大学厄巴纳香槟校区联合学院,浙江杭州 314400

浙江大学信息与电子工程学院,浙江杭州 310027

上海时代天使医疗器械有限公司天使研究院,上海 200433

浙江大学医学院附属妇产科医院产科,浙江杭州 310006

浙江省肿瘤医院,浙江杭州 331022

中国科学院杭州医学研究所,浙江杭州 310000

浙江大学医学院附属妇产科放射科,浙江杭州 310006

展开 >

关键词：

分割一切模型医学图像分割基础模型深度学习

出版年：

2024

DOI：

10.3788/CJL240614

中国激光

中国光学学会　中科院上海光机所

中国激光

CSTPCD北大核心

影响因子：2.204

ISSN：0258-7025

年,卷(期)：2024.51(21)