光电子·激光2024,Vol.35Issue(8) :793-802.DOI:10.16136/j.joel.2024.08.0852

基于改进YOLOv5的超分辨率和多尺度融合目标检测算法

Super-resolution and multi-scale fusion target detection algorithm based on improved YOLOv5

姚珊珊 王静宇 郝斌 张飞 高鹭 任晓颖
光电子·激光2024,Vol.35Issue(8) :793-802.DOI:10.16136/j.joel.2024.08.0852

基于改进YOLOv5的超分辨率和多尺度融合目标检测算法

Super-resolution and multi-scale fusion target detection algorithm based on improved YOLOv5

姚珊珊 1王静宇 1郝斌 1张飞 1高鹭 1任晓颖1
扫码查看

作者信息

  • 1. 内蒙古科技大学信息工程学院,内蒙古包头 014000
  • 折叠

摘要

为了提升目标检测算法在多尺度学习方面的能力,尤其是对小目标的检测能力,本文提出了一种基于改进YOLOv5的超分辨率和多尺度融合目标检测算法.首先,该算法使用子像素卷积代替原YOLOv5模型的上采样操作,提高图像的分辨率,并尽可能保留小目标的信息.其次,使用并行快速多尺度融合(parallel fast multi-scale fusion,PFMF)模块实现深层特征和浅层特征的双向融合,将原YOLOv5算法的3尺度预测升级为4尺度预测,以此提高模型多尺度特征学习能力和对小目标的检测效果.实验结果表明,与YOLOv5s相比,改进后的模型在PASCAL VOC数据集中,mAP@0.5提高了2.8个百分点,mAP@0.5∶0.95提高了3.5个百分点;在MS COCO数据集中,mAP@0.5提高了4.3个百分点,mAP@0.5:0.95提高了5.2个百分点.改进后的YOLOv5模型在多尺度检测,尤其是小目标的检测效果方面得到了提升,并具有一定的应用价值.

Abstract

To enhance the multi-scale learning capacity of target detection algorithms,particularly for small targets,this paper proposes a super-resolution and multi-scale fusion target detection algorithm based on an improved YOLOv5 framework.Firstly,instead of the up-sampling operation of the original YOLOv5 model,the algorithm utilizes sub-pixel convolution to enhance the image resolution and pre-serve the information of small targets to the greatest extent possible.Secondly,the algorithm utilizes the parallel fast multi-scale fusion(PFMF)module to achieve two-way fusion of deep and shallow features.This upgrade from the original YOLOv5 algorithm's 3-scale prediction to 4-scale prediction improves the model's ability to learn multi-scale features and detect small targets.The experimental results demon-strate that compared with YOLOv5s,the improved model achieves a 2.8%and 3.5%increase in mAP@0.5 and mAP@0.5∶0.95,respectively,on the PASCAL VOC dataset.Similarly,on the MS COCO dataset,the improved model achieves a 4.3%and 5.2%increase in mAP@0.5 and mAP@0.5∶0.95,respectively.The experiments demonstrate the improved YOLOv5 model's enhanced capability in multi-scale detection,particularly for small targets,and indicate its potential practical value.

关键词

目标检测/YOLOv5算法/子像素卷积/多尺度融合

Key words

object detection/YOLOv5 algorithm/sub-pixel convolution/multi-scale fusion

引用本文复制引用

基金项目

内蒙古自治区科技计划项目(2021GG0046)

内蒙古自治区科技计划项目(2021GG0048)

出版年

2024
光电子·激光
天津理工大学 中国光学学会

光电子·激光

CSCD北大核心
影响因子:1.437
ISSN:1005-0086
参考文献量20
段落导航相关论文