首页|基于全局上下文信息的遥感图像小目标检测

基于全局上下文信息的遥感图像小目标检测

扫码查看
针对光学遥感图像通常具有背景复杂、尺度多样化、目标普遍较小且方向各异等特点导致的目标检测精度低的问题,提出了一种基于多尺度信息融合的遥感小目标检测方法.首先设计了GCC3模块,该模块显著增强了模型区分目标与背景的能力,有效地提升了特征提取的质量,并显著提高了对小尺度目标的检测精度.为了实现对不同尺度特征信息的有效融合,提升多尺度目标检测的性能,引入了BiFPN(bi-directional feature pyramid network)结构,以替代YOLOv5中原有的PANet结构.此外,为了应对目标方向的不确定性以及缓解角度回归导致的边界不连续性问题,采用了圆形平滑标签方法,以实现对目标区域的精确定位.实验结果表明,所提方法在小目标检测方面具有明显优势,相比改进前的算法,其检测精度提高了4.9%,有效地提升了光学遥感图像检测的准确性.
Small Target Detection in Remote Sensing Images Based on Global Context Information
Objective Accurate identification and positioning of small targets(such as vehicles,buildings,and vegetation)in large-scale remote sensing images are crucial for military reconnaissance,urban planning,environmental monitoring,and other fields.However,traditional target detection methods often struggle to accurately identify these targets due to their small size,irregular shape,complex background,and illumination changes in the image.Therefore,there is a critical need for specialized research on small target detection.Research in this area can enhance the accuracy and efficiency of remote sensing image analysis,providing more reliable data support for decision-making and planning across various fields.This research holds significant theoretical and practical value.Methods Optical remote sensing images may suffer from low target detection accuracy due to complex backgrounds,varied scales,generally small targets,and different orientations.We propose a method for remote sensing small target detection based on multi-scale information fusion.Key improvements include:1)C3 module integration:designed to integrate a global context module,enhancing the model's ability to distinguish targets from backgrounds.This ensures the model focuses on key areas while ignoring unnecessary ones,which effectively improves target localization accuracy.2)Optimized PANet with BiFPN:to balance feature information across different scales and strengthen multi-scale target detection performance,we optimize the PANet and introduce the BiFPN.This feature pyramid network structure better utilizes multi-level feature information for accurate detection of targets of various sizes.3)Circular smooth label method:addressing the challenge of targets at different directions and angles,this method transforms the true rotation angle of target objects into a continuous probability distribution.This approach converts the angle regression problem into a classification problem,thereby improving detection and positioning accuracy.4)Image slicing preprocessing:to enable rapid detection of high-resolution images,we adopt an image slicing preprocessing method,which segments large images into smaller blocks for processing,significantly reducing false detection and missed detection of small targets.Results and Discussions To thoroughly validate the effectiveness of the proposed algorithm,we conduct a series of module ablation experiments on the DOT A dataset,with the experimental results detailed in Table 1.Based on the data shown in Table 1,our study successfully enhances the model's feature extraction capabilities,which strengthens its accuracy in locating target areas and achieves an algorithmic mAP of 83.7%.To further assess the performance of the improved algorithm,we make comparisons with advanced target detection algorithms such as R2CNN,YOLOv3,SCRDet,YOLOv5s,YOLOv6s,MaskOBB and YOLOv7 using the DOT A dataset.The experimental findings are summarized in Table 2.The analysis of these results demonstrates that the algorithm proposed in this study outperforms other comparison algorithms in terms of accuracy.To comprehensively evaluate the performance of the GCB-YOLOv5 algorithm,we employ the same remote sensing dataset for verification,comparing its detection rates with those of the original YOLOv5 algorithm and other algorithms in the YOLO series.The findings are presented in Table 3.Conclusions In the face of challenges such as diverse target scales,complex backgrounds,the prevalence of small targets,and diverse target orientations in optical remote sensing images,we first introduce the GCC3 module designed to enhance the model's ability to distinguish between targets and backgrounds.This enhancement directs the model's focus towards key areas while disregarding unnecessary ones,thereby significantly improving the detection accuracy of small-scale targets.Additionally,our study replaces the PANet structure with BiFPN to better address the detection requirements of multi-scale targets.The incorporation of circular smooth labeling effectively manages the multi-scale and directional uncertainties of the targets.The experimental results strongly support the significant advantages of the proposed algorithm in small-scale target detection.In future research,the model will be optimized for lightweight performance to balance the reasoning speed and detection accuracy,thereby enhancing its applicability in practical scenarios.

remote sensing imagedeep learningtarget detectionmultiscale

李红岩、徐保庆、张子扬、王伟峰

展开 >

西安科技大学电气与控制工程学院,陕西西安 710054

西安市电气设备状态监测与供电安全重点实验室,陕西西安 710054

遥感图像 深度学习 目标检测 多尺度

2024

光学学报
中国光学学会 中国科学院上海光学精密机械研究所

光学学报

CSTPCD北大核心
影响因子:1.931
ISSN:0253-2239
年,卷(期):2024.44(24)