无人系统技术2024,Vol.7Issue(5) :54-64.DOI:10.19942/j.issn.2096-5915.2024.05.48

基于跨模态渐进式融合的无人机目标检测方法

Cross-modal Progressive Fusion Method for UAV Target Detection

高棋 张骢 史瑞 张成 张跃
无人系统技术2024,Vol.7Issue(5) :54-64.DOI:10.19942/j.issn.2096-5915.2024.05.48

基于跨模态渐进式融合的无人机目标检测方法

Cross-modal Progressive Fusion Method for UAV Target Detection

高棋 1张骢 1史瑞 1张成 1张跃1
扫码查看

作者信息

  • 1. 沈阳理工大学装备工程学院,沈阳 110159
  • 折叠

摘要

近年来,无人机在军民两用的工作中面临着日益增长的技术需求,这给地面目标检测的精度带来了重大挑战.针对复杂背景下无人机目标检测算法精度低的问题,提出了一种改进YOLOv8的双光图像融合网络.首先,该网络采用特征级融合策略,充分利用可见光和红外数据,显著提升了在复杂场景下的目标检测性能.其次,为了减少多模态融合可能引入的冗余信息,设计了轻量化特征提取分支.最后,提出了渐进式跨模态融合模块,增强了多模态信息的特征交互,有效解决了不同模态融合时的信息差带来的检测性能受限问题.实验结果表明,与原始算法相比,所提方法拥有更高的精确率和召回率,同时在行人检测数据集LLVIP上AP50和mAP0.5-0.95 分别达到了91.2%、55.5%,FPS值达到了95.综上,与现有算法相比,所提方法在性能上表现更优,未来将继续探索其在不同环境中的应用潜力.

Abstract

In recent years,the increasing technology demand for Unmanned Aerial Vehicles(UAVs)in dual-use operations during day and night has posed significant challenges to the accuracy of ground target detection.To address the low accuracy of UAV target detection algorithms in complex backgrounds,an improved dual-spectrum image fusion network based on YOLOv8 is proposed,called the Progressive Cross-Modal Fusion Network(PCMFNet).Firstly,a feature-level fusion strategy is employed,allowing both visible light and infrared data to be fully leveraged,significantly enhancing target detection performance in complex scenarios.Secondly,a lightweight feature extraction branch is designed to mitigate the potential introduction of redundant information during multi-modal fusion.Finally,a Progressive Cross-modal Fusion Module is proposed,which strengthens the feature interaction of multi-modal information and effectively addresses the decline in detection performance caused by differences between modalities.Experimental results indicate that higher precision and recall are achieved by PCMFNet compared to the original algorithm.Specifically,on the LLVIP dataset,AP50 and mAP0.5-0.95 values of 91.2% and 55.5% are reached,respectively,with a frame-per-second(FPS)rate of 95.In conclusion,PCMFNet demonstrates superior performance compared to existing algorithms,and future work will explore its potential application in various environments.

关键词

无人机/目标检测/YOLOv8/多模态融合/渐进式融合/轻量化/模态差异

Key words

UAV/Target Detection/YOLOv8/Multimodal Fusion/Progressive Fusion/Light-weight/Modal Difference

引用本文复制引用

出版年

2024
无人系统技术

无人系统技术

ISSN:
段落导航相关论文