首页|增值税发票解析和识别系统研究与实现

增值税发票解析和识别系统研究与实现

扫码查看
增值税发票是各类经济交易中最常用的支付凭证,财务报账过程中人工处理发票效率低下。基于此,文章设计并实现了增值税发票解析和识别系统。该系统对不同类型的发票文档进行分类处理,文本类型的电子发票直接用XML文档处理工具进行解析,纸质发票的扫描件使用PaddleOCR进行识别。在识别纸质发票扫描件之前先解析二维码,得到二维码的位置信息和文本内容。基于位置信息对发票图像进行方向矫正,将矫正之后的图像送入OCR识别模块进行文字识别,并通过二维码解析得到的内容对OCR识别结果进行验证,从而提高识别结果的可靠性。
Research and Implementation of Value Added Tax Invoice Parsing and Recognition System
Value added tax invoices are the most commonly used payment vouchers in various economic transactions,and manual processing of invoices during financial reporting is inefficient.Based on this,the paper designs and implements a value added tax invoice parsing and recognition system.The system classifies different types of invoice documents.Electronic invoices in text type are directly parsed using XML document processing tools,while scanned copies of paper invoices are recognized using PaddleOCR.Before identifying scanned copies of paper invoices,first parse the QR code to obtain the location information and text content of the QR code.Based on location information,the invoice image is directional corrected,and the corrected image is sent to the OCR recognition module for text recognition.The content obtained through QR code parsing is used to verify the OCR recognition results,thereby improving the reliability of the recognition results.

value added tax invoiceinvoice parsinginvoice recognition

汪香君、林冰清、陈玉强、刘会芬

展开 >

深圳技术大学 大数据与互联网学院,广东 深圳 518118

增值税发票 发票解析 发票识别

2024

现代信息科技
广东省电子学会

现代信息科技

ISSN:2096-4706
年,卷(期):2024.8(10)