改进Mask R-CNN的馆藏报纸图像内容分割

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：开展馆藏报纸图像内容分割研究,能提升文字识别准确率,对促进机器识别取代人工操作、提高图书馆数字化工作效率具有重要意义.文章根据报纸图像呈现的特征,提出一种基于改进Mask R-CNN的算法,实现报纸图像内容分割.首先,通过优化锚框比例和损失函数,对原始Mask R-CNN算法进行改进.其次,采用数据增强、调整训练参数开展样本训练.最后,通过实验的方式对改进后的Mask R-CNN算法训练模型和原始算法训练模型进行比较,并采用AP_bbox和AP_segm评价指标对实验结果进行评估,改进后的算法训练模型AP_bbox为0.935,AP_segm为0.943,均超过原始算法训练模型.实验结果表明,改进后的Mask R-CNN算法能够实现报纸图像内容有效检测与分割.

外文标题：Improved Mask R-CNN for Content Segmentation of Archival Newspaper Images in Library Collection

外文摘要：The study of content segmentation of newspaper images in the library collections is crucial for enhancing the accuracy of text recognition,which is of great importance for advancing machine recognition to replace manual operation,and improving the efficiency of digitization in libraries.This paper proposes an algorithm based on an improved Mask R-CNN to separate the content of newspaper images.First,the original Mask R-CNN is improved by optimizing the anchor box ratios and loss functions.Secondly,samples are trained by data augmentation and adjusted training parameters.Finally,the training model based on the improved Mask R-CNN is compared with the original model through experiments,and the experimental results are evaluated using the AP_bbox and AP_segm evaluation indicators.The improved algorithm training model scored 0.935 in AP_bbox and 0.943 in AP_segm,outperforming the original training model in both categories.This study suggests that the improved Mask R-CNN algorithm can achieve effective detection and segmentation of newspaper image content.

外文关键词：

Mask R-CNNdigitization of newspapercontent segmentationtarget detection

作者：

倪劼、叶江松、谢恩泽

展开 >

作者单位：

南京图书馆

关键词：

Mask R-CNN 报纸数字化内容分割目标检测

基金：

江苏省图书馆学会研究课题

项目编号：

22YB067

出版年：

2024

图书馆论坛

广东省立中山图书馆

图书馆论坛

CSTPCDCSSCICHSSCD北大核心

影响因子：1.864

ISSN：1002-1167

年,卷(期)：2024.44(6)

参考文献量7