河北工业大学学报2024,Vol.53Issue(4) :31-39,50.DOI:10.14081/j.cnki.hgdxb.2024.04.003

融合全局特征的道路场景目标检测方法

Object detection method in road scene based on fusion of global features

王倩 马杰 赵月华 叶茂 武麟
河北工业大学学报2024,Vol.53Issue(4) :31-39,50.DOI:10.14081/j.cnki.hgdxb.2024.04.003

融合全局特征的道路场景目标检测方法

Object detection method in road scene based on fusion of global features

王倩 1马杰 1赵月华 1叶茂 1武麟1
扫码查看

作者信息

  • 1. 河北工业大学 电子信息工程学院,天津 300401
  • 折叠

摘要

复杂交通环境下目标检测中存在很多外界干扰因素,导致通用的目标检测算法效果较差.针对目标检测方法中全局特征信息利用不充分,小目标、遮挡目标检测精度低,以及模型计算量大等问题,提出一种基于改进YOLOv5s的融合全局特征目标检测方法.首先,对YOLOv5s的主干网络进行扩展,得到更深层的特征图以增强较大目标的语义信息;其次,在此基础上引入全局信息融合模块代替原模型中的Neck部分,以3D卷积的方式融合各尺度信息;然后,设计了一种基于位置的先验框匹配方法,在原图尺度上搜索与真实框匹配的先验框;最后,使用Copy-Paste数据增强方法增大小目标样本数量并使用DIoU-NMS作为后处理方法进行非极大值抑制.该模型在BDD100K数据集中平均精确率(mean Average Preci-sion,mAP)为54.55%,检测速度为63.72帧每秒(Frames Per Second,FPS).与原始YOLOv5s算法相比,该方法在检测速度及精度方面均有明显优势.

Abstract

There are many external interference factors in object detection in complex traffic environment,which lead to poor detection effect of some general algorithms.A fusion global feature object detection method based on improved YO-LOv5s is proposed to address the insufficient utilization of global information,low accuracy of small object and occlusion object detection,and the large amount of calculation.Firstly,the backbone of YOLOv5s is expanded to obtain deeper fea-ture maps to enhance the semantic information of large objects.Secondly,a global information fusion module is intro-duced to replace the Neck part,and the information of each scale is fused by 3D convolution.Then,a location-based pri-or box matching method is designed,which searches for the prior box matching the bounding box on the original image scale.Finally,the Copy-Paste data augmentation method is applied to increase the number of small target samples,and DIoU-NMS is used as the post-processing method.The mean Average Precision(mAP)of the model in BDD100K dataset is 54.55%,the detection speed is 63.72 Frames Per Second(FPS).Compared with the YOLOv5s algorithm,this method has significant advantages in terms of detection speed and accuracy.

关键词

道路场景/目标检测/特征融合/先验框匹配/数据增强

Key words

road scene/object detection/feature fusion/prior box matching/data augmentation

引用本文复制引用

基金项目

河北省自然科学基金资助项目(F2020202045)

河北省研究生创新项目(CXZZBS2020026)

天津市教委科研计划项目(2018KJ268)

出版年

2024
河北工业大学学报
河北工业大学

河北工业大学学报

CSTPCD
影响因子:0.344
ISSN:1007-2373
段落导航相关论文