基于三维点云与图像融合的轨道交通场景行人检测方法

Pedestrian detection method in rail transit scenes based on fusion of 3D point clouds and images

贺佳¹

扫码查看

作者信息

1. 国能包神铁路集团有限责任公司,内蒙古包头 014010
折叠

摘要

在轨道交通场合由于行人非法侵限造成的安全事故时有发生,严重影响列车的安全运行.利用单一传感器数据进行行人检测存在检测结果召回率低、结果缺乏类别信息或方位信息等问题,无法满足实际现场需求.针对这些问题,文章提出一种基于三维点云与图像融合的轨道交通场景行人检测方法.该方法首先利用构建的轨道交通场景行人数据集训练的深度学习模型分别对三维点云与图像进行行人检测,在此基础上根据靶标在三维点云与图像中的空间位置一致性原理求解出激光雷达与相机之间的旋转平移矩阵,最后将三维点云目标检测结果投影至图像坐标系.为解决多个相邻目标之间误匹配、检测结果之间一对多的问题,通过计算2种检测结果之间的交并比和中心点距离作为融合的约束条件,进而更精确地检测行人.在现场采集的数据集上的试验结果表明,相比于三维点云与图像检测结果,在满足时效性的同时,该方法在检测精度相差不大的情况下,召回率分别提高了 4.5％和5.5％,能够有效减少由于行人漏检可能造成的安全事故,满足在列车实际运营过程中对行人检测的需求.

Abstract

Safety incidents caused by pedestrians illegally intruding onto railway tracks occur frequently in rail transit scenes,sig-nificantly affecting the safe operation of trains.Utilizing single sensor data for pedestrian detection often leads to low recall rates of de-tection results and lack of category or orientation information in the results,which cannot meet practical field requirements.To address these issues,this paper proposed a pedestrian detection method based on the fusion of 3D point clouds and images in rail transit scenes.This method first employed a deep learning model trained on a constructed dataset of pedestrian data in rail transit scenes to detect pedes-trians separately in 3D point clouds and images.Subsequently,based on the principle of spatial position consistency of the targets in 3D point clouds and images,the rotation and translation matrix between the LiDAR and camera was solved.Finally,the 3D point cloud ob-ject detection results were projected onto the image coordinate system.To solve the issues of misalignment between multiple adjacent tar-gets and the one-to-many relationship between detection results,the intersection over union ratio and center point distance between the two detection results were calculated as fusion constraints,enabling more accurate pedestrian detection.Experimental results using data acquired from the field demonstrate that,compared to detection results from separate data of 3D point clouds and images,while maintain-ing timeliness,this method improves the recall rate by 4.5％and 5.5％,respectively,effectively reducing the risk of safety accidents caused by missed pedestrian detections,meeting the demand for pedestrian detection during actual train operations.

关键词

三维点云/图像检测/深度学习/多传感器融合/目标检测

Key words

3D point clouds/image detection/deep learning/multi-sensor fusion/object detection

引用本文复制引用

出版年

2024

机车电传动

中国南车集团株洲电力机车厂

机车电传动

CSTPCD

影响因子：0.347

ISSN：1000-128X

段落导航