基于多通道交叉注意力融合的三维目标检测算法

扫码查看

原文链接

万方数据
维普

中文摘要：针对现有单阶段三维目标检测算法对点云下采样特征利用方式单一、特征对长程上下文信息的聚合程度无法满足算法性能提升需求的问题,本文提出了基于多通道交叉注意力融合的单阶段三维目标检测算法.首先,设计通道交叉注意力模块用于融合下采样特征,可基于交叉注意力机制在通道层面上增强多尺度特征对不同感受野下长程空间信息的表达能力;然后,提出级联特征激励模块,结合原始下采样特征对通道交叉注意力加权特征进行级联激励,提升算法对关键空间特征的学习能力.在公共自动驾驶数据集KITTI上进行了大量实验并与主流算法对比,本文算法作为单阶段目标检测算法,在车辆类别 3 个难度级别上的检测准确率分别为 91.34%、79.85%和 75.98%,较基线算法分别提升了 4.83%、3.26%和 3.32%.实验结果证明了本文算法及所提模块在三维目标检测任务上的有效性和先进性.

外文标题：3D object detection algorithm with multi-channel cross attention fusion

外文摘要：To solve the problems that the existing single-stage 3D object detection algorithm utilizes point cloud down-sampling features in a single way and the degree of aggregation of features for the long-range contextual information cannot meet the requirement of enhancing the algorithm performance,we propose a single-stage 3D object detection al-gorithm based on multi-channel cross attention fusion.First,the channel-wise cross attention module is designed to fuse the down sampled features,which can enhance the expression ability of multi-scale features for the long-range spatial information under different receptive field based on the cross attention mechanism.Then,a cascade feature excitation module is proposed to combine the original downsampling features to cascade channel-wise cross attention weighted features to enhance the algorithm's learning ability for key spatial features.Extensive experiments were conducted on the public autonomous driving dataset KITTI and compared with mainstream algorithms.As a single-stage algorithm,the detection accuracy was 91.34%,79.85%and 75.98%for the three difficulty levels of car categories,which were 4.83%,3.26%and 3.32%better than the baseline algorithm.The experimental results demonstrate the effectiveness and ad-vancement of the algorithm and the proposed modules for 3D object detection task.

外文关键词：

3D point cloudautonomous drivingLiDARdeep learning3D object detectionpillarcross attentionsingle-stage algorithm

作者：

鲁斌、杨振宇、孙洋、刘亚伟、王明晗

展开 >

作者单位：

华北电力大学控制与计算机工程学院,河北保定 071000

华北电力大学河北省能源电力知识计算重点实验室,河北保定 071000

关键词：

三维点云自动驾驶激光雷达深度学习三维目标检测柱体素交叉注意力单阶段算法

基金：

河北省重点研发计划项目河北省在读研究生创新能力培养资助项目

项目编号：

20310103DCXZZBS2023153

出版年：

2024

DOI：

10.11992/tis.202305029

智能系统学报

中国人工智能学会　哈尔滨工程大学

智能系统学报

CSTPCD北大核心

影响因子：0.672

ISSN：1673-4785

年,卷(期)：2024.19(4)

参考文献量3