计算机辅助设计与图形学学报2024,Vol.36Issue(10) :1570-1582.DOI:10.3724/SP.J.1089.2024.20043

结合多分支结构与门控机制的高分辨率语义分割方法

A High Resolution Semantic Segmentation Method via Multi-Branch Structure and Gating Mechanism

杜可 叶春明
计算机辅助设计与图形学学报2024,Vol.36Issue(10) :1570-1582.DOI:10.3724/SP.J.1089.2024.20043

结合多分支结构与门控机制的高分辨率语义分割方法

A High Resolution Semantic Segmentation Method via Multi-Branch Structure and Gating Mechanism

杜可 1叶春明1
扫码查看

作者信息

  • 1. 上海理工大学管理学院 上海 200093
  • 折叠

摘要

针对HRNetv2等多分支结构网络在语义分割任务中无法有效地融合多层次特征的问题,提出一种基于门控机制的新型多层次特征融合方法.首先,构建门控融合单元,利用门控机制有选择性地融合多个分支的特征信息;其次,提出自底向上的融合方法,通过阶梯式地传播语义丰富的高级特征与细节饱满的低级特征来增强每一条分支的特征表示;最后将各个分支的特征在通道维度进行拼接,获得预测输出并采用双线性插值算法恢复至原图像尺寸.实验结果表明,仅需增加少量参数,该方法在PASCAL VOC 2012+Aug和Cityscapes数据集上的mIoU分别取得77.01%和80.43%,相较于HRNetv2-W48分别提升了 1.14个百分点和1.92个百分点,同时性能超越诸多基线模型.

Abstract

In order to solve the problem that HRNetv2 and other multi-branch structure networks cannot ef-fectively fuse multi-level features in semantic segmentation tasks,a new multi-level feature fusion method based on the gating mechanism is proposed.Firstly,a gated fusion unit is constructed to fuse the feature in-formation of multiple branches selectively.Secondly,a bottom-up fusion method is adopted to progressively enhance the feature representation of each branch by means of spreading semantically high-level features and detailed low-level features.Finally,features of branches are concatenated channel-wisely to output the predicted mask,and the bilinear interpolation algorithm is used to restore the original image size.Experi-ments show that the proposed method with only a few parameters achieves 77.01%mIoU and 80.43%mIoU in PASCAL VOC 2012+Aug and Cityscapes respectively,increases by 1.14 percentage points and 1.92 per-centage points compared with HRNetv2-W48,and outperforms many baseline models.

关键词

多分支结构/多层次特征融合/门控机制/自底向上

Key words

multi-branch structure/multi-level feature fusion/gating mechanism/bottom-up

引用本文复制引用

出版年

2024
计算机辅助设计与图形学学报
中国计算机学会

计算机辅助设计与图形学学报

CSTPCDCSCD北大核心
影响因子:0.892
ISSN:1003-9775
段落导航相关论文