融合注意力机制的DeeplabV3+服装图像分割方法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：针对在服装图像语义分割中存在由服装颜色、纹理、背景以及多目标遮挡导致的边缘分割粗糙和分割精度低等问题,文中基于Deeplabv3+框架,提出了一种图像语义分割算法(FFDNet).首先,模型的骨干网络采用ResNet101网络,并添加通道空间注意力模块(Feature-Enhanced Attention Module,FEAM),通过对特征图加权来挖掘并增强特征信息,提高网络表达能力.其次引入特征对齐模块(Feature Align Module,FAM)作为一种新的上采样方式,解决不同尺度特征融合之间特征未对齐导致分割错误且效率低的问题,以此提高对服装图像分割的准确性和鲁棒性.最后,FFDNet在Deepfashion2和PASCAL VOC 2012数据集上的平均交并比分别达到55.2％和79.4％;在参数量方面,该模型相比原模型在Deepfashion2上仅增加了0.61 MB.与其他现有经典模型对比,其分割性能更优,能有效捕获图像局部细节信息,减少像素分类错误.

外文标题：Clothing Image Segmentation Method Based on Deeplabv3+Fused with Attention Mechanism

外文摘要：Aiming at the problems of rough edge segmentation and low segmentation accuracy caused by color,texture,back-ground and multi-object occlusion in clothing image segmentation,an image semantic segmentation method(FFDNet)based on Deeplabv3+with attention mechanism is proposed.Firstly,the backbone network of the model uses the ResNet101 network.The feature-enhanced attention module(FEAM)is added at the end of it.The feature map is weighted from the two dimensions of channel and spatial to mine and enhance the feature information and optimize the segmentation edge to improve network clarity.Secondly,a feature align module(FAM)is introduced as a novel upsampling method to address the problem of segmentation er-rors and low efficiency caused by misalignment between features during the fusion of different scale features,so as to to improve the accuracy and robustness of clothing image segmentation.Finally,the mean intersection over union of the proposed method reaches 55.2％and 79.4％on Deepfashion2 and PASCAL VOC2012,respectively.In terms of parameter size,the model only in-creases by 0.61MB compared to the original model on Deepfashion2.The segmentation performance of the FFDNet is superior to the existing state-of-the-art network models,which can effectively capture image local detail information and reduce pixel classifi-cation errors.

外文关键词：

Clothing imageSemantic segmentationAttention mechanismDeeplabv3+networkFeature alignment

作者：

肖雅慧、张自力、胡新荣、彭涛、张俊

展开 >

作者单位：

武汉纺织大学计算机与人工智能学院武汉 430200

湖北省服装信息化工程技术研究中心武汉 430200

武汉工程大学计算机科学与工程学院武汉 430205

关键词：

服装图像语义分割注意力机制 Deeplabv3+网络特征对齐

基金：

湖北省教育厅科学技术研究项目

项目编号：

B2017066

出版年：

2024

DOI：

10.11896/jsjkx.230900153

计算机科学

重庆西南信息有限公司（原科技部西南信息中心）

计算机科学

CSTPCD北大核心

影响因子：0.944

ISSN：1002-137X

年,卷(期)：2024.51(z1)

参考文献量29