融合多层感知注意力的电极微观图像分割方法

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据
维普

中文摘要：针对氮氧传感器电极微观图像存在的物质边缘模糊、伪影、灰度不均等问题,将U-Net作为基础模型,提出融合多层感知注意力的电极微观图像语义分割方法.首先对U-Net编码层的不同尺度输出特征图使用3×3卷积进行降维,利用双线性插值统一特征尺度,以实现多尺度特征融合,增强特征信息提取能力并补偿编码下采样中的特征损失;其次通过加入空间金字塔池化来提取多尺度信息并通过1×1卷积减小计算量,同时提出多层感知注意力模块,以捕获主干特征图和增强语义信息特征图的空间位置与通道依赖关系;最后计算不同语义信息特征图的相似度关系,结合交叉熵损失提出具有捕获空间相似性能力的损失函数,在训练过程中对关键信息进行监督,辅助主干特征图学习空间位置信息,增强分割性能.实验结果表明,该方法的类别平均像素准确率为96.75%,平均交并比为94.04%,微观F1分数为96.92%,浮点运算次数为7.78×109,网络所含参数量为8.08×106.相对U-Net、SegNet等模型,该方法在提高少量模型复杂度的情况下,能有效改善边缘模糊及物质伪影问题,捕获空间位置与通道信息,保留图像细节特征,提高分割准确率.

外文标题：Electrode Microscopic Image Segmentation Method by Fusing Multi-layer Perceptual Attention

外文摘要：To address the problems of blurred material edges,artifacts,and uneven grayscale in electrode microscopic images of NOx sensors,an electrode microscopic image semantic segmentation method that fuses multi-layer perceptual attention is proposed,in which U-Net is the base model.First,different scale output feature maps of the U-Net encoding layer with a 3×3 convolution are used to reduce dimensionality.Furthermore,bilinear interpolation is used to unify feature scales to achieve multi-scale feature fusion,enhance feature information extraction,and compensate for feature loss from encoding downsampling.Second,by adding spatial pyramid pooling to extract multi-scale information and employing a 1×1 convolution to reduce the calculation,a multi-layer perceptual attention module is proposed to capture the spatial position and channel dependence of the backbone feature map and the feature map with enhanced semantic information.Finally,a loss function with the ability to capture spatial similarity is proposed based on the similarity relationship of feature maps with different semantic information combined with cross-entropy loss.The key information is supervised during the training process to assist the backbone feature map to learn spatial position information and enhance the segmentation performance.The experimental results indicate that the Mean Pixel Accuracy(MPA)of the proposed method is 96.75%,the Mean Intersection over Union(MIoU)is 94.04%,Micro-F1 is 96.92%,FLOPs is 7.78×109,and the number of parameters contained in the network is 8.08×106.Compared with models such as U-Net and SegNet,the proposed method can effectively address problems of edge blurring and material artifacts while increasing a little model complexity.Furthermore,it can capture spatial position and channel information,preserve detailed features of the image,and improve segmentation accuracy.

外文关键词：

electrodemicroscopic imageNOx sensorsemantic segmentationperceptual attention

作者：

徐威、付晓薇、李曦、汪尧坤

展开 >

作者单位：

武汉科技大学计算机科学与技术学院,湖北武汉 430065

智能信息处理与实时工业系统湖北省重点实验室,湖北武汉 430065

华中科技大学人工智能与自动化学院,湖北武汉 430074

关键词：

电极微观图像氮氧传感器语义分割感知注意力

基金：

国家自然科学基金国家自然科学基金广东省重点研发计划项目深圳科技创新基础研究重点项目

项目编号：

61873323U20662022022B0111130004JCYJ20210324115606017

出版年：

2024

DOI：

10.19678/j.issn.1000-3428.0067208

计算机工程

华东计算技术研究所　上海市计算机学会

计算机工程

CSTPCD北大核心

影响因子：0.581

ISSN：1000-3428

年,卷(期)：2024.50(1)

被引量1
参考文献量1