基于改进YOLOv5的超分辨率和多尺度融合目标检测算法

Super-resolution and multi-scale fusion target detection algorithm based on improved YOLOv5

姚珊珊 ¹王静宇 ¹郝斌 ¹张飞 ¹高鹭 ¹任晓颖¹

扫码查看

作者信息

1. 内蒙古科技大学信息工程学院,内蒙古包头 014000
折叠

摘要

为了提升目标检测算法在多尺度学习方面的能力,尤其是对小目标的检测能力,本文提出了一种基于改进YOLOv5的超分辨率和多尺度融合目标检测算法.首先,该算法使用子像素卷积代替原YOLOv5模型的上采样操作,提高图像的分辨率,并尽可能保留小目标的信息.其次,使用并行快速多尺度融合(parallel fast multi-scale fusion,PFMF)模块实现深层特征和浅层特征的双向融合,将原YOLOv5算法的3尺度预测升级为4尺度预测,以此提高模型多尺度特征学习能力和对小目标的检测效果.实验结果表明,与YOLOv5s相比,改进后的模型在PASCAL VOC数据集中,mAP@0.5提高了2.8个百分点,mAP@0.5∶0.95提高了3.5个百分点;在MS COCO数据集中,mAP@0.5提高了4.3个百分点,mAP@0.5:0.95提高了5.2个百分点.改进后的YOLOv5模型在多尺度检测,尤其是小目标的检测效果方面得到了提升,并具有一定的应用价值.

Abstract

To enhance the multi-scale learning capacity of target detection algorithms,particularly for small targets,this paper proposes a super-resolution and multi-scale fusion target detection algorithm based on an improved YOLOv5 framework.Firstly,instead of the up-sampling operation of the original YOLOv5 model,the algorithm utilizes sub-pixel convolution to enhance the image resolution and pre-serve the information of small targets to the greatest extent possible.Secondly,the algorithm utilizes the parallel fast multi-scale fusion(PFMF)module to achieve two-way fusion of deep and shallow features.This upgrade from the original YOLOv5 algorithm's 3-scale prediction to 4-scale prediction improves the model's ability to learn multi-scale features and detect small targets.The experimental results demon-strate that compared with YOLOv5s,the improved model achieves a 2.8％and 3.5％increase in mAP@0.5 and mAP@0.5∶0.95,respectively,on the PASCAL VOC dataset.Similarly,on the MS COCO dataset,the improved model achieves a 4.3％and 5.2％increase in mAP@0.5 and mAP@0.5∶0.95,respectively.The experiments demonstrate the improved YOLOv5 model's enhanced capability in multi-scale detection,particularly for small targets,and indicate its potential practical value.

关键词

目标检测/YOLOv5算法/子像素卷积/多尺度融合

Key words

object detection/YOLOv5 algorithm/sub-pixel convolution/multi-scale fusion

引用本文复制引用

基金项目

内蒙古自治区科技计划项目(2021GG0046)

内蒙古自治区科技计划项目(2021GG0048)

出版年

2024

光电子·激光

天津理工大学中国光学学会

光电子·激光

CSCD北大核心

影响因子：1.437

ISSN：1005-0086

参考文献量20

段落导航