基于图像低维特征融合的航拍小目标检测模型

扫码查看

原文链接

万方数据
维普

中文摘要：针对无人机航拍图像目标检测中视野变化大、时空信息复杂等问题,文中基于YOLOv5(You Only Look Once Version5)架构,提出基于图像低维特征融合的航拍小目标检测模型.引入CA(Coordinate Attention),改进Mo-bileNetV3的反转残差块,增加图像空间维度信息的同时降低模型参数量.改进YOLOv5特征金字塔网络结构,融合浅层网络中的特征图,增加模型对图像低维有效信息的表达能力,进而提升小目标检测精度.同时为了降低航拍图像中复杂背景带来的干扰,引入无参平均注意力模块,同时关注图像的空间注意力与通道注意力;引入VariFocal Loss,降低负样本在训练过程中的权重占比.在VisDrone数据集上的实验验证文中模型的有效性,该模型在有效提升检测精度的同时明显降低复杂度.

外文标题：Model for Small Object Detection in Aerial Photography Based on Low Dimensional Image Feature Fusion

外文摘要：To address the challenges of significant changes in the field of view and complex spatiotemporal information in unmanned aerial vehicle aerial image target detection,a model for small object detection in aerial photography based on low dimensional image feature fusion is presented grounded on the YOLOv5(you only look once version 5)architecture.Coordinate attention is introduced to improve the inverted residuals of MobileNetV3,thereby increasing the spatial dimension information of images while reducing parameters of the model.The YOLOv5 feature pyramid network structure is improved to incorporate feature images from shallow networks.The ability of the model to represent low-dimensional effective information of images is enhanced,and consequently the detection accuracy of the proposed model for small objects is improved.To reduce the impact of complex background in the image,the parameter-free average attention module is introduced to focus on both spatial attention and channel attention.VariFocal Loss is adopted to reduce the weight proportion of negative samples in the training process.Experiments on VisDrone dataset demonstrate the effectiveness of the proposed model.The detection accuracy is effectively improved while the model complexity is significantly reduced.

外文关键词：

You Only Look Once Version 5(YOLOv5)Small Target DetectionAttention Mecha-nismLoss Function

作者：

蔡逢煌、张家翔、黄捷

展开 >

作者单位：

福州大学电气工程与自动化学院福州 350108

关键词：

You Only Look Once Version5(YOLOv5) 小目标检测注意力机制损失函数

基金：

国家自然科学基金国家自然科学基金青年科学基金

项目编号：

9236710962301163

出版年：

2024

DOI：

10.16451/j.cnki.issn1003-6059.202402005

模式识别与人工智能

中国自动化学会,国家智能计算机研究开发中心,中国科学院合肥智能机械研究所

模式识别与人工智能

CSTPCD北大核心

影响因子：0.954

ISSN：1003-6059

年,卷(期)：2024.37(2)

参考文献量31