计算机工程与设计2024,Vol.45Issue(10) :3017-3025.DOI:10.16208/j.issn1000-7024.2024.10.018

多模态特征融合的行人检测算法

Pedestrian detection algorithm based on multimodal feature fusion

陈舒静 蒙祖强
计算机工程与设计2024,Vol.45Issue(10) :3017-3025.DOI:10.16208/j.issn1000-7024.2024.10.018

多模态特征融合的行人检测算法

Pedestrian detection algorithm based on multimodal feature fusion

陈舒静 1蒙祖强1
扫码查看

作者信息

  • 1. 广西大学计算机与电子信息学院,广西南宁 530004
  • 折叠

摘要

针对红外图像清晰度和分辨率较低,可见光图像光照不足等问题,通过融合可见光和红外光图像的特征,设计一种基于YOLOv5改进的多模态行人检测算法IMV5(improved multimodal YOLOv5).对传统的级联融合方法进行改进,结合注意力机制,设计一种多模态特征融合模块PMWM(pedestrian modal adaptive weight fusion module),将可见光和红外光图像融合,提高特征融合后的检测效果.加入优化过的空间金字塔池化结构,在保持感受野不变的情况下提升检测效果.在特征层上进行目标检测,预测出行人的概率和位置,实现行人检测功能.实验结果表明,IMV5算法在KAIST行人检测据集上的检测效果得到了明显提升.

Abstract

Aiming at the problems of low clarity and resolution of infrared images and insufficient illumination of visible images,an improved multi-modal pedestrian detection algorithm IMV5(improved multimodal YOLOv5)based on YOLOv5 was designed by fusing the characteristics of visible and infrared images.The traditional cascade fusion method was improved,and combined with the attention mechanism,a multi-modal feature fusion module PMWM(pedestrian modal adaptive weight fusion module)was designed to fuse visible and infrared images to improve the detection effect after feature fusion.The optimized spatial pyra-mid pooling structure was added to improve the detection effect while keeping the receptive field unchanged.The target detection was performed on the feature layer to predict the probability and location of pedestrians and realize the pedestrian detection func-tion.Experimental results show that the detection effect of IMV5 algorithm on KAIST pedestrian detection dataset is significantly improved.

关键词

多模态/YOLOv5/行人检测算法/特征融合/注意力机制/空间金字塔/目标检测

Key words

multi-modal/YOLOv5/pedestrian detection algorithm/feature fusion/attention mechanism/spatial pyramid/object detection

引用本文复制引用

基金项目

国家自然科学基金项目(62266004)

出版年

2024
计算机工程与设计
中国航天科工集团二院706所

计算机工程与设计

CSTPCD北大核心
影响因子:0.617
ISSN:1000-7024
段落导航相关论文