联合语义分割和深度估计的交通场景感知算法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：受不同像素级视觉任务间的特征信息能够相互指导和优化的思路启发,基于多任务学习理论提出联合语义分割和深度估计的交通场景感知算法.提出双向跨任务注意力机制,实现任务间的全局相关性显式建模,引导网络充分挖掘和利用任务间互补模式信息.构建多任务Transformer,增强特定任务特征的空间全局表示,实现跨任务全局上下文关系的隐式建模,促进任务间互补模式信息的融合.设计编-解码融合上采样模块来有效融合编码器蕴含的空间细节信息,生成精细的高分辨率特定任务特征.在Cityscapes数据集上的实验结果表明,所提算法的语义分割平均交并比达到 79.2%,深度估计均方根误差为 4.485,针对 5 类典型交通参与者的距离估计平均相对误差为 6.1%,能够以比现有主流算法更低的计算复杂度获得更优的综合性能.

外文标题：Traffic scene perception algorithm with joint semantic segmentation and depth estimation

外文摘要：Inspired by the idea that feature information between different pixel-level visual tasks can guide and optimize each other,a traffic scene perception algorithm based on multi-task learning theory was proposed for joint semantic segmentation and depth estimation.A bidirectional cross-task attention mechanism was proposed to achieve explicit modeling of global correlation between tasks,guiding the network to fully explore and utilize complementary pattern information between tasks.A multi-task Transformer was constructed to enhance the spatial global representation of specific task features,implicitly model the cross-task global context relationship,and promote the fusion of complementary pattern information between tasks.An encoder-decoder fusion upsampling module was designed to effectively fuse the spatial details contained in the encoder to generate fine-grained high-resolution specific task features.The experimental results on the Cityscapes dataset showed that the mean IoU of semantic segmentation of the proposed algorithm reached 79.2%,the root mean square error of depth estimation was 4.485,and the mean relative error of distance estimation for five typical traffic participants was 6.1%.Compared with the mainstream algorithms,the proposed algorithm can achieve better comprehensive performance with lower computational complexity.

外文关键词：

perception of traffic environmentmulti-task learningsemantic segmentationdepth estimationTransformer

作者：

范康、钟铭恩、谭佳威、詹泽辉、冯妍

展开 >

作者单位：

厦门理工学院福建省客车先进设计与制造重点实验室,福建厦门 361024

厦门大学航空航天学院,福建厦门 361102

关键词：

交通环境感知多任务学习语义分割深度估计 Transformer

基金：

福建省自然科学基金福建省自然科学基金

项目编号：

2023J0114392019J01859

出版年：

2024

DOI：

10.3785/j.issn.1008-973X.2024.04.004

浙江大学学报(工学版)

浙江大学

浙江大学学报(工学版)

CSTPCD北大核心

影响因子：0.625

ISSN：1008-973X

年,卷(期)：2024.58(4)

参考文献量27