基于高效聚合特征增强网络的合成孔径雷达船舰检测方法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：合成孔径雷达(SAR)由于散射效应以及波长和天线尺寸的分辨率限制,难以获取小尺寸目标的细节和边界信息,因此,检测准确性不高.为了提高SAR船舰检测的准确率以及降低误检率,提出了一种基于高效聚合特征增强网络的SAR船舰检测方法.首先,在主干网络中采用空间通道注意力机制,构建出高效层卷积块作为主要的特征提取模块,以增强模型的特征获取性能,提高模型对船舰目标的识别能力;其次,特征融合部分采用Inception NeXt模块来提高算法效率;最后,在主干网络以及特征提取部分之间构建出一种全局增强特征金字塔分支结构,进一步融合全局特征,避免传输过程中的低维度特征损失,以提升网络的特征融合能力,使其即使对于复杂背景下的小目标仍然能展现出可靠的检测能力.为了证明所提网络的有效性,在SSDD数据集上作了对比实验,实验结果表明,相较于YOLOv7,所提网络的准确率提升了 2.5个百分点,召回率提升了 9.2个百分点,交并比(IoU)阈值为0.5时的平均精度提升了 6.4个百分点,IoU为0.5:0.95时的平均精度提升了 9.9个百分点.实验结果证明,所提网络在提升SAR船舰检测精度、改善误检漏检等方面有显著优势,可作为高精度的检测方法来有效应对SAR船舰检测中存在的问题.

外文标题：Synthetic Aperture Radar Ship Detection Method Based on Highly Efficient Aggregated Feature Enhancement Network

外文摘要：Objective Synthetic aperture radar(SAR)is a microwave imaging radar that utilizes the principle of synthetic aperture to achieve high resolution.It has various characteristics such as all day,all weather,high resolution,and wide bandwidth.It is not affected by weather,day,and night and can obtain high-quality,high-resolution,large-scale,and long-distance images.SAR ship target detection technology can provide important technical support in industries such as ocean,oil,port management,marine resource development,and marine scientific research,as it can detect ships and equipments on the sea and detect potential safety risks in advance.At the same time,ship target detection technology has important strategic significance for strengthening maritime monitoring,border patrol,maritime rescue,and safety assurance of maritime channels.We aim to improve the accuracy of SAR ship detection,reduce false positives,and enhance the adaptability of the model.Methods Traditional SAR image target detection methods include texture analysis,polarization characteristics,and constant false alarm rate(CFAR)algorithms.Among them,the most widely used is the CFAR detection algorithm,which has certain advantages in speed,but its drawbacks are high computational complexity and susceptibility to complex backgrounds,resulting in unsatisfactory detection efficiency.In the actual SAR imaging process,the backgrounds of SAR images are mostly ports,islands,reefs,and other buildings.These backgrounds have high grayscale characteristics and strong confusion.Therefore,for the detection of ship targets on the sea,multiple complex backgrounds,various irregular arrangements of ships,similar target misdetection,and other uncertain factors should be considered.The target features of uncertain factors have a certain degree of similarity with ships.Therefore,we propose an efficient aggregation feature enhancement network(EAFENet)to solve the problems of low accuracy,serious false detections,and unstable effects in current SAR ship target detection.The core idea is to efficiently aggregate stacking modules and introduce residual structures to effectively transmit gradient and feature information and alleviate the problem of gradient vanishing and feature loss.The combination of the CBS(convolution+batch normalization+SiLU)module,CBAM(channel spatial attention mechanism),and leaky ReLU activation function increases the sensitivity of the network to target features and introduces low-dimensional feature fusion.Through multi-layer feature pyramid connections,the expression of features is further extended and enhanced,and the residual idea is used for skip connections,enhancing the learning ability and generalization of the model.Results and Discussions In this article,qualitative and quantitative experiments are conducted on EAFENet and other mainstream models for detecting SAR ships,as well as ablation experimental analysis.To demonstrate the effectiveness of each improvement point in this article,the YOLOv7 network model is used as a benchmark,and six sets of experiments are conducted on the SSDD dataset,with the same environment and parameters.The detected images include multiple targets,few targets,and complex backgrounds.As shown in Table 3,the effect is not ideal only when attention is used alone,and the effect is significantly improved when the mentioned EL-CB(efficient layer convolutional block)is used.The proposed global enhanced feature pyramid network branch structure is used to improve the performance of the feature pyramid and enhance the fusion of shallow features.The accuracy is improved by nearly three percentage points;the recall rate and mAP0.50:0.95 are both improved by nearly 10 percentage points,and mAP0.5 is improved by 6.4 percentage points,proving the effectiveness of each module in this article.In order to further compare the performance of the proposed model,the improved algorithm is used for comparative experiments with the current mainstream algorithms.The experimental environment is the same,and the same training and testing sets are used.The indicators of Faster R-CNN,SSD,YOLOv5,YOLOv7,CenterNet,and our algorithm are shown in Table 4.In terms of accuracy,the EAFENet model is more prominent than other mainstream algorithms.EAFENet performs the best with an accuracy of 95.40％,followed by YOLOv5 and YOLOv7,with an accuracy of 93.32％and 92.90％,respectively.The accuracy of SSD and Faster R-CNN is 84.10％and 82.70％,respectively.Compared with other algorithms,EAFENet uses a more efficient feature extraction module,which to some extent reduces misjudgment.However,mainstream algorithms such as SSD have relatively weak designs in feature extraction and other aspects,as well as a lack of deeper fusion of shallow features in the feature fusion process,resulting in relatively inaccurate prediction results.Therefore,when considering the mAP value,EAFENet still performs the best,reaching 98.90％of mAP0.5,followed by YOLOv5 and YOLOv7,reaching 94.25％and 92.50％,respectively.The mAP0.5 of SSD and Faster R-CNN is 86.01％and 89.17％,respectively.However,the proposed algorithm has undergone deeper fusion in the network structure,resulting in a slight decrease in FPS(frame per second).Overall,compared with other classic algorithms,the proposed algorithm still has significant advantages in speed,and the greatly reduced false detection rate can meet the basic needs of real-time detection.Conclusions In response to the problems of low accuracy and high false detection rate in SAR ship detection,we propose a SAR ship detection method based on an EAFENet.An EL-CB is constructed through spatial channel attention as the feature extraction module of the backbone network,and Inception NeXt is used as the feature extraction part of neck to improve algorithm efficiency,enabling the network model to better understand multi-scale information with detail perception ability.In the network structure,a global enhanced feature pyramid branch structure is constructed by fusing deep-level features with low-level features.This enables the feature extraction network to simultaneously consider both low-level and deep-level information,effectively enhancing the ability to obtain features and ensuring better stability for ship detection in complex backgrounds.The experimental results show that compared with various current detection algorithms,the proposed algorithm has higher detection accuracy and can meet the needs of real-time detection.In future research,the network structure will be further optimized to improve detection accuracy and efficiency.

外文关键词：

deep learningtarget detectionhighly efficient aggregated feature enhancement networkattention mechanismsynthetic aperture radar ship detection

作者：

单慧琳、刘文星、王兴涛、付相为、李长帅、张银胜

展开 >

作者单位：

南京信息工程大学电子与信息工程学院,江苏南京 210044

无锡学院电子信息工程学院,江苏无锡 214105

关键词：

深度学习目标检测高效聚合特征增强网络注意力机制合成孔径雷达船舰检测

基金：

国家自然科学基金国家自然科学基金江苏省一流本科课程江苏省产教融合型一流课程

项目编号：

62071240621061112021YLKC0052022-133

出版年：

2024

DOI：

10.3788/AOS231285

光学学报

中国光学学会　中国科学院上海光学精密机械研究所

光学学报

CSTPCD北大核心

影响因子：1.931

ISSN：0253-2239

年,卷(期)：2024.44(12)