联合度量指标损失和U-Net的文档图像二值化

Document image binarization via combining measure index loss and U-Net

张晶 ¹周稻祥 ²吴永飞 ²冯姝³

扫码查看

作者信息

1. 太原理工大学数学学院,山西晋中 030600
2. 太原理工大学大数据学院,山西晋中 030600
3. 山西农业大学基础部,山西太谷 030801
折叠

摘要

当前深度神经网络模型在图像分割时均采用交叉熵做训练损失函数,当损失值变小时评价指标不一定变得更优.为解决上述缺陷,提出一种基于度量指标损失的U-Net网络模型.由于错误接受率和错误拒绝率变小时度量指标F-Mea-sure会上升,因此构建半错误率损失函数.采用分治策略,将文档图像分割成固定大小的图像块,分别进行二值化.在文档图像竞赛数据集上进行大量对比实验,实验结果表明,该方法相比原始U-Net,在4个度量指标上均有提升,二值化结果图像的文字连通性更好、噪声更少.

Abstract

At present,the deep neural networks generally use cross-entropy as the training loss function.When the loss score becomes small,the evaluation measure index may not become better.To solve the above defect,a U-Net network based on measure index loss was proposed.When the false acceptance rate and false rejection rate become smaller,the F-Measure increa-ses.Therefore,a measure index loss function named half total error rate was constructed.The divide-and-conquer strategy was adopted to divide the document image into many image patches with fixed size,each patch was binarized separately.Extensive experiments were conducted on eight document image competition datasets.Experimental results show that this method gets bet-ter results for the four metrics when compared with the original U-Net.Moreover,the binarized image has better text connectivity and less noise.

关键词

文档图像二值化/卷积神经网络/交叉熵/度量指标损失/打印图像/手写图像/深度学习

Key words

document image binarization/convolutional neural network/cross entropy/measure index loss/printed images/handwritten image/deep learning

引用本文复制引用

基金项目

国家自然科学基金项目(62201331)

国家自然科学基金项目(62101376)

国家自然科学基金项目(61901292)

山西省应用基础研究计划基金项目(201901D211078)

山西省应用基础研究计划基金项目(20210302124543)

出版年

2024

计算机工程与设计

中国航天科工集团二院706所

计算机工程与设计

CSTPCD北大核心

影响因子：0.617

ISSN：1000-7024

参考文献量1

段落导航