基于双分支多尺度特征融合的道路场景语义分割

Semantic segmentation of road scene based on dual-branch multi-scale feature fusion

扫码查看

原文链接

NETL
NSTL
维普
万方数据

中文摘要：针对实时场景图像语义分割网络模型难以在分割精度、模型参数量和推理速度中取得平衡的问题,提出了一种轻量级实时语义分割算法.首先,该算法以双分支结构作为该网络的基本结构,并在各个分支中分别对不同分辨率的特征图进行特征信息提取;其次,在高分辨率分支和低分辨率分支中分别加入改进的金字塔切分注意力模块和残差空洞金字塔模块,并在不同分支之间进行了双侧特征融合,以充分融合空间信息和语义信息;最后,设计了特征融合模块,并通过上采样操作对图像进行恢复,以实现图像语义分割.该算法在Cityscapes和Camvid数据集上进行验证,以5.02 M的参数量分别取得了 76.8％和70.5％的mIoU,分别达到了 56 fps和147 fps的运行速度.

外文摘要：Due to the imbalance between segmentation accuracy,model parameters,and inference speed in real-time scene semantic segmentation network models for scene images,a lightweight real-time semantic segmentation algorithm is proposed in this paper.Firstly,the algorithm employs a two-branch structure as the basic framework of the network,ex-tracting the feature information from different resolution feature maps in each branch.Secondly,improved pyramid split at-tention modules and residual atrous pyramid modules are incorporated into the high-resolution and low-resolution branches,respectively.Bilateral feature fusion is carried out between different branches to fully integrate spatial and semantic infor-mation.Finally,a feature fusion module is designed,and image semantic segmentation is achieved by restoring the image through upsampling operations.The algorithm achieves 76.8％and 70.5％mIoU with 5.02 M parameters and reaches up to 56 fps or 147 fps inference speed respectively in Cityscapes and Camvid datasets.

外文关键词：

Image processingReal-time semantic segmentationLightweightAttention moduleMulti-scale features

作者：

肖哲璇、陈辉、王硕

展开 >

作者单位：

安徽理工大学计算机科学与工程学院,安徽淮南 232001

关键词：

图像处理实时语义分割轻量级注意力模块多尺度特征

基金：

安徽省重点教学研究项目

项目编号：

2020jyxm0458

出版年：

2024

宁夏师范学院学报

宁夏师范学院

宁夏师范学院学报

影响因子：0.138

ISSN：1674-1331

年,卷(期)：2024.45(1)

参考文献量2