首页|融合卷积和Transformer的多尺度皮肤病变分割算法

融合卷积和Transformer的多尺度皮肤病变分割算法

扫码查看
皮肤病变自动分割对协助医生临床诊断、治疗及术后观察都具有非常重要的意义.现有卷积擅长建立局部相关性但无法捕获像素长程依赖关系,而Tansformer可以建立特征信息的全局依赖关系但会造成局部细节信息丢失.因此,提出了一种融合卷积和Transformer的多尺度自动分割网络.采用ResNet34作为基础编码块,利用其金字塔结构建立病灶的多级局部相关性;采用Swin Transformer模块捕获上下文特征的长程依赖关系,考虑到病灶形状多变、大小不一等情况,提出多尺度特征聚合模块来进一步提取上下文特征多尺度信息;采用具有注意力机制的解码块逐步融合编码块提取到的多级语义信息.实验结果表明,所提模型在ISIC 2017数据集上测试所得的Dice系数分别高达89.55%,FPS高达83,与其他先进模型相比,本模型参数更少、推理速度更快、精度更高.
Research on Multi-scale Skin Lesion Segmentation Algorithm Based on Convolution and Transformer
Automatic segmentation of skin lesions is of great significance to assist doctors in clinical diagnosis,treatment,and postoperative observation.Existing convolutions are good at establishing local correlations but cannot capture pixel long-range dependencies,while Tansformer can establish global dependencies of feature information but can cause the loss of local details.Therefore,a multi-scale automatic segmentation network that combines convolution and Transformer is proposed.Firstly,ResNet34 is used as the basic coding block,using its pyramid structure to establish multi-level local correlation of lesions;secondly,the Swin Transformer module is used to capture the long-term dependence of context features.Considering the variable shape and size of lesions,a multi-scale feature aggregation module is proposed to further extract multi-scale information of context features;finally,a decoding block with attention mechanism is used to gradually fuse the multi-level semantic information extracted from the coding block.The experimental results show that the Dice coefficients obtained by the proposed model tested on the ISIC 2017 dataset are as high as 89.55%,and the FPS is as high as 83.Compared with other advanced models,this model has fewer parameters,faster reasoning speed,and higher accuracy.

image processingSwin Transformermulti-scale feature aggregation moduleattention mechanism

蒋新辉、李筱林、韦春苗、覃镇锋

展开 >

柳州铁道职业技术学院通信与物联网学院,广西柳州 545616

广西生态工程职业技术学院汽车与信息工程学院,广西柳州 545004

图像处理 Swin Transformer 多尺度特征聚合模块 注意力机制

2022年度广西高校中青年项目2023年度广西高校中青年项目

2022KY14142023KY1264

2024

无线电工程
中国电子科技集团公司第五十四研究所

无线电工程

影响因子:0.667
ISSN:1003-3106
年,卷(期):2024.54(3)
  • 32