高精度实时语义分割算法框架:多通道深度加权聚合网络

High precision real-time semantic segmentation algorithm:Multi-channel deep weighted aggregation network

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：近年来随着深度学习技术的不断发展,涌现出各种基于深度学习的语义分割算法,然而绝大部分分割算法都无法实现推理速度和语义分割精度的兼得.针对此问题,提出一种多通道深度加权聚合网络(MCDWA_Net)的实时语义分割框架.该方法首先引入多通道思想,构建一种3通道语义表征模型,3通道结构分别用于提取图像的3类互补语义信息:低级语义通道输出图像中物体的边缘、颜色、结构等局部特征;辅助语义通道提取介于低级语义和高级语义的过渡信息,并实现对高级语义通道的多层反馈;高级语义通道获取图像中上下文逻辑关系及类别语义信息.之后,设计一种3类语义特征加权聚合模块,用于输出更完整的全局语义描述.最后,引入一种增强训练机制,实现训练阶段的特征增强,进而改善训练速度.实验结果表明,所提出方法在复杂场景中进行语义分割不仅有较快的推理速度,且有很高的分割精度,能够实现语义分割速度与精度的均衡.

外文摘要：In recent years,with the continuous development of deep learning technology,various semantic segmentation algorithms based on deep learning have emerged,but most of the segmentation algorithms cannot achieve high speed and high accuracy at the same time,and a real-time semantic segmentation framework for multi-channel depth-weighted aggregation networks(MCDWA_Net)is proposed to solve this problem.Firstly,the multi-channel idea is introduced to construct a three-channel semantic representation model,which is used to extract three types of complementary semantic information of the image:1)Low-level semantic channel outputs the local features such as the edge,color,and structure of the object in the image;2)Auxiliary semantic channel extracts the transition information between low-level semantics and high-level semantics,and realizes multi-layer feedback to the high-level semantic channel;3)Advanced semantic channel obtains context logical relationships and category semantic information in images.Then,a three-class semantic feature weighted aggregation module is designed to output a more complete global semantic description.Finally,an enhancement training mechanism is introduced to realize the feature enhancement in the training stage,thereby improving the training speed.Experimental results show that the proposed method not only has fast inference speed,but also has high segmentation accuracy in complex scenes,which can achieve the balance of semantic segmentation speed and accuracy.

外文关键词：

deep learningsemantic segmentationsemantic featurecontext informationdepth fusion

作者：

齐咏生、陈培亮、高学金、董朝轶、魏淑娟

展开 >

作者单位：

内蒙古工业大学电力学院,呼和浩特 010080

大规模储能技术教育部工程研究中心,呼和浩特 010080

内蒙古自治区高等学校智慧能源技术与装备工程研究中心,呼和浩特 010080

北京工业大学信息学部,北京 100080

展开 >

关键词：

深度学习语义分割语义特征上下文信息深度融合

基金：

国家自然科学基金内蒙古自治区科技计划内蒙古自治区科技计划内蒙古自治区自然科学基金

项目编号：

622413092020GG0282021GG1642020MS05029.2021MS06018

出版年：

2024

DOI：

10.13195/j.kzyjc.2022.1699

控制与决策

东北大学

控制与决策

CSTPCD北大核心

影响因子：1.227

ISSN：1001-0920

年,卷(期)：2024.39(5)

参考文献量26