基于体素自注意力辅助网络的三维目标检测

3D Object Detection Based on Voxel Self-Attention Auxiliary Networks

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：针对目前依赖于卷积神经网络(CNN)的激光雷达目标检测算法对自动驾驶场景的空间结构理解不深刻导致检测效果差的问题,提出了一种能够增强特征提取能力、可直接应用于大部分基于体素的检测算法的体素自注意力辅助(VSAA)网络.首先,VSAA网络在体素特征编码的基础上进一步构造体素哈希表对体素进行二次编码,在后续自注意力计算过程中有效提高了搜索相关体素的效率;然后,VSAA网络将自注意力机制应用到体素层面,从而获取到丰富的全局信息和深层次的上下文语义信息;最后,将VSAA网络应用在基准算法SECOND和PV-RCNN上,进而提出了VA-SECOND和VA-PVRCNN算法,并通过融合VSAA网络与CNN特征弥补了CNN感受野小的缺点,增强了检测算法对整个空间场景的理解能力.在KITTI数据集上的实验结果表明:相比于基准算法,VA-SECOND和VA-PVRCNN算法对所有检测目标的平均检测精度分别提高了1.16百分点和1.54百分点,证明了VSAA网络的有效性.

外文摘要：A voxel self-attention auxiliary(VSAA)network is proposed to address the issue of poor detection performance in LiDAR object detection algorithms for autonomous driving scenes.This issue stems from a lack of deep understanding of the spatial structure,owing to its reliance on a convolutional neural network(CNN).VSAA network can be directly applied to most voxel-based target detection algorithms to enhance its feature extraction capabilities.First,the VSAA network enhances the efficiency of searching relevant voxels in subsequent self-attention calculations by further constructing voxel hash tables for secondary encoding,based on the foundation of voxel feature encoding.Second,VSAA network applies the self-attention mechanism at the voxel level to capture comprehensive global information and profound contextual semantic information.Finally,this study proposes the VA-SECOND and VA-PVRCNN algorithms by applying VSAA network to the benchmark algorithms SECOND and PV-RCNN,respectively.The features of VSAA network and CNN are fused to compensate for the disadvantage of the small receptive field of the CNN,thus enhancing the detection ability of the algorithm and allowing it to understand an entire spatial scene.Experimental results obtained using the KITTI dataset show that,compared with the benchmark algorithms,VA-SECOND and VA-PVRCNN algorithms improve the average detection accuracy of all detected targets by 1.16 percentage point and 1.54 percentage point,respectively,which proves the effectiveness of the VSAA network.

外文关键词：

LiDARobject detectionautomatic drivevoxelself-attention

作者：

曹捷、彭忆强、樊利康、王龙飞

展开 >

作者单位：

西华大学汽车与交通学院,四川成都 610039

西华大学汽车测控与安全四川省重点实验室,四川成都 610039

四川省新能源汽车智能控制与仿真测试技术工程研究中心,四川成都 610039

关键词：

激光雷达目标检测自动驾驶体素自注意力

出版年：

2024

DOI：

10.3788/LOP240923

激光与光电子学进展

中国科学院上海光学精密机械研究所

激光与光电子学进展

CSTPCD北大核心

影响因子：1.153

ISSN：1006-4125

年,卷(期)：2024.61(24)