首页|融合场景先验的船名文本检测方法

融合场景先验的船名文本检测方法

扫码查看
目的 船名文本信息是船舶身份识别的核心要素.真实场景船舶影像中文本区域尺度不一导致船名文本检测存在漏检等问题.同时,现有自然场景文本检测算法难以排除背景文本、图案等因素对船名检测任务的干扰.因此,针对以上问题提出一种融合场景先验的船名检测方法.方法 首先,依据船首与船名目标关联性,提出一个基于先验损失的区域监督模块,以约束模型关注船名文本区域特征.然后,为了提高文本区域细粒度,提出一个基于非对称卷积的船名区域定位模块,增强文本区域边缘信息,进一步提高船名检测的召回率.结果 本文收集、标注并公开发布了一个真实场景船名文本检测数据集CBWLZ2023进行实验验证,并与最新的8种通用自然场景文本检测方法进行比较.本文算法在船名文本检测任务上取得了 94.2%的F1值,相比于性能第2的模型,F1值提高了 2.3%;相比于基线模型,F1值提高了 2.8%.同时在CBWLZ2023数据集中进行了参数分析实验及消融实验以验证算法各模块的有效性.实验结果证明提出的算法能准确获取边界清晰的文本区域,改善了船名文本检测的效果.结论 本文提出的融合场景先验的船名检测模型,可以解决船名文本尺度不一、背景文本干扰带来的问题,在检测精度上超过了现有的场景文本检测算法,具有有效性与先进性.CBWLZ2023可由https://aistudio.baidu.com/aistudio/datasetdetail/224137 获取.
Ship name text detection method with scene priors fusion
Objective Ships are the most important carriers of waterborne transportation,accounting for over two-thirds of global trade in goods transportation.Ship names,as one of the most crucial identification pieces of information for ships,possess uniqueness and distinctiveness,forming the core elements for intelligent ship identity recognition.Achieving ship name text detection is crucial in enhancing waterway traffic regulation and improving maritime transport safety.However,in real-world scenarios,given the variations in ship size and diverse ship types,the areas of ship name text regions differ,and the aspect ratio of ship name text varies greatly across different ship types,directly affecting the accuracy of ship name text detection and increasing the likelihood of missed detections.Additionally,during ship name text detection,various elements,such as background text and patterns in the scene,can introduce interference.Existing natural scene text detec-tion algorithms do not completely eliminate these interference factors.Directly applying them to ship name text detection tasks may lead to poor algorithm robustness.Therefore,this study addresses the aforementioned issues and proposes a ship name detection method based on scene prior information.Method First,given that ship name text regions are usually fixed at the bow and two sides of the ship,this study proposes a region supervision module based on prior loss,which utilizes the correlation between the bow and the ship name text target.Through the classification and regression branches on the shared feature maps,prior information of the bow region is obtained,constructing a scene prior loss with bow correlation.During training,the model simultaneously learns the ship name text detection main task and the bow object detection auxiliary task and updates the network parameters through joint losses to constrain the model's attention to the ship name text region fea-tures and eliminate background interference.Then,a ship name region localization module based on asymmetric convolu-tion is further proposed to improve the granularity of text region localization.It achieves lateral connections between deep semantic information and shallow localization information by fusing feature layers with different scales between networks.On the basis of the additive property of convolution,three convolution kernels with sizes of 3 × 3,3 × 1,and 1×3 are used to enhance the fused feature maps,balancing the weights of the kernel region features to enrich the text edge information.Finally,a differentiable binarization optimization is introduced to generate text boundaries and realize ship name text region localization.Given that no ship name text detection dataset is publicly available,this study constructs the CBWLZ2023 dataset,comprising 1 659 images of various types of ships,such as fishing vessels,passenger ships,cargo ships,and war-ships,captured in real-world scenes such as waterways and ports,featuring differences in background,ship poses,light-ing,text attributes,and character sizes.Result To validate the effectiveness of the proposed algorithm,this study col-lected,annotated,and publicly released a real-world ship name text detection dataset CBWLZ2023 for experimental verifi-cation and compared it with eight state-of-the-art general natural scene text detection methods.Quantitative analysis results show that the proposed algorithm achieves an F-value of 94.2%in the ship name text detection task,representing a 2.3%improvement over the second-best-performing model.Moreover,ablation experiments demonstrate that the model's F-value increases by 2.3%and 0.7%after incorporating the region supervision module based on prior loss and the ship name region localization module based on asymmetric convolution,respectively.The fused model's F-value increases by 2.8%,confirming the effectiveness of each algorithm module.Qualitative analysis results indicate that the proposed algorithm exhibits stronger robustness than other methods in dealing with text of varying scales and background interference,accu-rately capturing text regions with clear boundaries and effectively reducing false positives and missed detections.Experi-mental results demonstrate that the proposed algorithm enhances ship name text detection performance.Conclusion This study proposes a ship name detection method based on scene prior information.The algorithm has two main advantages.First,it fully utilizes the strong correlation between the bow region of the ship and the ship name text region,suppressing the interference of background information in ship name detection tasks.Second,it integrates multiscale text feature infor-mation to enhance the robustness of multiscale text object detection.The proposed algorithm achieves higher detection accuracy than existing scene text detection algorithms on the CBWLZ2023 dataset,demonstrating its effectiveness and advancement.The CBWLZ2023 can be obtained from https://aistudio.baidu.com/aistudio/datasetdetail/224137.

ship name text detectionscene priori lossregional supervisionfeature enhancementasymmetric convolu-tion

陈博伟、易尧华、汤梓伟、彭继兵、尹爱国

展开 >

武汉大学遥感信息工程学院,武汉 430079

自然资源部西南山地自然资源遥感监测工程技术创新中心,成都 610000

武汉大学数字成像与智能感知研究中心,武汉 430079

珠海奔图电子有限公司,珠海 519060

展开 >

船名文本检测 场景先验损失 区域监督 特征增强 非对称卷积

国家重点研发计划资助自然资源部西南山地自然资源遥感监测工程技术创新中心开放课题基金项目

2021YFB2206200RSMNRSCM-2024-001

2024

中国图象图形学报
中国科学院遥感应用研究所,中国图象图形学学会 ,北京应用物理与计算数学研究所

中国图象图形学报

CSTPCD北大核心
影响因子:1.111
ISSN:1006-8961
年,卷(期):2024.29(10)