首页|改进Mask R-CNN的馆藏报纸图像内容分割

改进Mask R-CNN的馆藏报纸图像内容分割

扫码查看
开展馆藏报纸图像内容分割研究,能提升文字识别准确率,对促进机器识别取代人工操作、提高图书馆数字化工作效率具有重要意义.文章根据报纸图像呈现的特征,提出一种基于改进Mask R-CNN的算法,实现报纸图像内容分割.首先,通过优化锚框比例和损失函数,对原始Mask R-CNN算法进行改进.其次,采用数据增强、调整训练参数开展样本训练.最后,通过实验的方式对改进后的Mask R-CNN算法训练模型和原始算法训练模型进行比较,并采用AP_bbox和AP_segm评价指标对实验结果进行评估,改进后的算法训练模型AP_bbox为0.935,AP_segm为0.943,均超过原始算法训练模型.实验结果表明,改进后的Mask R-CNN算法能够实现报纸图像内容有效检测与分割.
Improved Mask R-CNN for Content Segmentation of Archival Newspaper Images in Library Collection
The study of content segmentation of newspaper images in the library collections is crucial for enhancing the accuracy of text recognition,which is of great importance for advancing machine recognition to replace manual operation,and improving the efficiency of digitization in libraries.This paper proposes an algorithm based on an improved Mask R-CNN to separate the content of newspaper images.First,the original Mask R-CNN is improved by optimizing the anchor box ratios and loss functions.Secondly,samples are trained by data augmentation and adjusted training parameters.Finally,the training model based on the improved Mask R-CNN is compared with the original model through experiments,and the experimental results are evaluated using the AP_bbox and AP_segm evaluation indicators.The improved algorithm training model scored 0.935 in AP_bbox and 0.943 in AP_segm,outperforming the original training model in both categories.This study suggests that the improved Mask R-CNN algorithm can achieve effective detection and segmentation of newspaper image content.

Mask R-CNNdigitization of newspapercontent segmentationtarget detection

倪劼、叶江松、谢恩泽

展开 >

南京图书馆

Mask R-CNN 报纸数字化 内容分割 目标检测

江苏省图书馆学会研究课题

22YB067

2024

图书馆论坛
广东省立中山图书馆

图书馆论坛

CSTPCDCSSCICHSSCD北大核心
影响因子:1.864
ISSN:1002-1167
年,卷(期):2024.44(6)
  • 7