光电子·激光2024,Vol.35Issue(4) :344-350.DOI:10.16136/j.joel.2024.04.0636

融合多尺度信息和跨维特征引导的轻量行人检测

Lightweight pedestrian detection based on multi-scale information and cross-dimensional feature guidance

张云佐 李文博 郭威
光电子·激光2024,Vol.35Issue(4) :344-350.DOI:10.16136/j.joel.2024.04.0636

融合多尺度信息和跨维特征引导的轻量行人检测

Lightweight pedestrian detection based on multi-scale information and cross-dimensional feature guidance

张云佐 1李文博 2郭威2
扫码查看

作者信息

  • 1. 石家庄铁道大学信息科学与技术学院,河北石家庄 050043;河北省电磁环境效应与信息处理重点实验室,河北石家庄 050043
  • 2. 石家庄铁道大学信息科学与技术学院,河北石家庄 050043
  • 折叠

摘要

针对复杂道路场景下行人检测精度与速度难以提升的问题,提出一种融合多尺度信息和跨维特征引导的轻量级行人检测算法.首先以高性能检测器YOLOX为基础框架,构建多尺度轻量卷积并嵌入主干网络中,以获取多尺度特征信息.然后设计了一种端到端的轻量特征引导注意力模块,采用跨维通道加权的方式将空间信息与通道信息融合,引导模型关注行人的可视区域.最后为减少模型在轻量化过程中特征信息的损失,使用增大感受野的深度可分离卷积构建特征融合网络.实验结果表明,相比于其他主流检测算法,所提算法在KITTI数据集上达到了71.03%的检测精度和80 FPS的检测速度,在背景复杂、密集遮挡、尺度不一等场景中都具有较好的鲁棒性和实时性.

Abstract

Aiming at the detection accuracy and speed of pedestrian detection in complex road environ-ment,a lightweight pedestrian detection algorithm based on multi-scale information and cross-dimension-al feature guidance is proposed.Firstly,based on the high-performance detector YOLOX,a multi-scale lightweight convolution is constructed and embedded in the backbone network to obtain multi-scale fea-ture information.Secondly,an end-to-end lightweight feature guided attention module is designed,which guides the model to focus on the visible region of pedestrian targets by fusing spatial information and re-lated information through cross-dimensional channel weighting mehod.Finally,in order to reduce the loss of feature information in the process of lightweight of the model,a feature fusion network is con-structed by depthwise separable convolution with increasing the depth of the receptive field.The experi-mental results show that compared with other mainstream detection algorithms,the proposed algorithm on the KITTI dataset reaches 71.03%detection accuracy and 80 FPS detection speed,which has better robustness and real-time performance in scenes with complex background,dense occlusion and different scales.

关键词

行人检测/多尺度/跨维特征引导/特征融合/轻量化模型

Key words

pedestrian detection/multi-scale/cross-dimensional feature guidance/feature fusion/light-weight model

引用本文复制引用

基金项目

国家自然科学基金(61702347)

国家自然科学基金(62027801)

河北省自然科学基金(F2022210007)

河北省自然科学基金(F2017210161)

河北省高等学校科学技术研究项目(ZD2022100)

河北省高等学校科学技术研究项目(QN2017132)

中央引导地方科技发展资金项目(226Z0501G)

出版年

2024
光电子·激光
天津理工大学 中国光学学会

光电子·激光

CSCD北大核心
影响因子:1.437
ISSN:1005-0086
参考文献量20
段落导航相关论文