首页|基于深度学习的护照文本信息识别

基于深度学习的护照文本信息识别

扫码查看
应用基于深度学习的场景文本检测和场景文本识别的算法,实现对多国护照图片中关键文本信息的结构化输出具有重要意义.该文针对检测算法难以处理极端长宽比和小尺度文字的问题,使用了基于像素分割的检测方法,并且进行多尺度特征融合获得不同尺度的特征图;针对字符像素受干扰的情况,使用循环神经网络进行图像特征的上下文关系建模,以减轻污损干扰;针对无关文本干扰的情况,使用了基于文本和布局信息的多模态Transformer进行建模,获取关键信息的多模态模式,滤去无关信息,进行关键词匹配和提取,获得了较好的实验结果.
A Passport Text Information Recognition System Based on Deep Learning
It's meaningful to apply existing scene text detection and recognition algorithms based on deep learning to passports so as to implement structured output of the key information in multinational passport pictures.In this paper,Aiming at solving the problem of detecting texts with extreme aspect ratio and relatively small size,we use pixel segmentation based algorithm and conduct multi-scale feature fusion;to alleviate the interference in character pixel,we use recurrent neural network to model the context of picture features,so as to reduce the interference of defacing;to avoid the interference of irrelevant text,the multi-modal Transformer based on text information and layout information is used for modeling,so as to obtain multi-modal mode of key information,filtering out irrelevant information,matching and extracting key words.The experimental results show that the system has a lot of advantages.

passportdeep learningtext detectiontext recognitionkey information extraction

谢子敬

展开 >

华中科技大学电子信息与通信学院,湖北 武汉 430074

护照 深度学习 文本检测 文本识别 关键信息提取

2024

数字通信世界
电子工业出版社

数字通信世界

影响因子:0.162
ISSN:1672-7274
年,卷(期):2024.(10)