基于鸟瞰图的多任务端到端 3D目标检测方法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：由于在自动驾驶过程中激光雷达扫描过程中受到空间限制,采集到的点云数据损失了过多的信息.在聚合表示方法的基础上,提出了基于鸟瞰图的多任务端到端3D目标检测方法.首先,利用纹理特征提取器与语义特征提取器对特征金字塔的高级特征层与低级特征层进行特征融合,获得扩展的特征金字塔层,使得自动驾驶过程中的中等目标(骑自行车的人)与小目标(行人)不再耦合在同一级别的特征金字塔层,提高了点云拓扑信息的区域细节性.其次,在损失函数中引入用于分类的Focal loss与用于回归的CIOU loss,改善了正负样本比例,对目标检测框进行了约束,使得目标框在回归过程中能够以更高精度收敛.实验结果表明,提出的方法在自动驾驶中小目标检测中具有更强的检测能力与更高的检测精度.

外文标题：End-to-End Multi-Task 3D Object Detection Method Based on Bird Eye·s View Images

外文摘要：Due to spatial limitations during the scanning process of LiDAR during autonomous driving,the collect-ed point cloud data loses too much information.Based on compressed representation,a novel multitask end to end 3D object detection model in Bird's Eye View is presented.Firstly,texture feature extractor and semantic feature extractor were used to fuse the high-level and low-level feature layers of a feature pyramid network(FPN)to obtain an extend-ed feature pyramid layer,which eliminated the coupling of medium target(cyclist)and small target(pedestrian)at the same level of feature pyramid layer during the auto-driving process,improving the regional detail of the point cloud topology information.Secondly,the Focal loss and CIOU loss were introduced in the loss function to improve the pro-portion of negative sample and limit the bounding box.As a result,the bounding box can converge more accurately in the regression process.Finally,The experimental results show that the proposed method has better detection ability and higher detection accuracy in small and medium targets detection of auto-driving.

外文关键词：

Automatic drivingBird eye's viewMultiscale object detectionLow coupling pyramid layers

作者：

许可、李文卓

展开 >

作者单位：

沈阳建筑大学电气与控制工程学院,辽宁沈阳 110168

关键词：

自动驾驶鸟瞰图多尺度目标检测低耦合金字塔层

基金：

国家重点研发计划项目

项目编号：

2021YFB3201600

出版年：

2024

计算机仿真

中国航天科工集团公司第十七研究所

计算机仿真

CSTPCD

影响因子：0.518

ISSN：1006-9348

年,卷(期)：2024.41(1)

参考文献量20