宜宾学院学报2024,Vol.24Issue(6) :27-32,91.DOI:10.19504/j.cnki.issn1671-5365.2024.06.05

基于卷积神经网络的酒文献关键词自动识别与标注算法

An Automatic Keyword Recognition and Annotation Algorithm for Liquor Literature Based on Convolu-tional Neural Networks

张桃 童旭 胡隆河 杨强
宜宾学院学报2024,Vol.24Issue(6) :27-32,91.DOI:10.19504/j.cnki.issn1671-5365.2024.06.05

基于卷积神经网络的酒文献关键词自动识别与标注算法

An Automatic Keyword Recognition and Annotation Algorithm for Liquor Literature Based on Convolu-tional Neural Networks

张桃 1童旭 2胡隆河 1杨强1
扫码查看

作者信息

  • 1. 宜宾学院人工智能与大数据学部,四川宜宾 644000
  • 2. 成都理工大学信息工程学院,四川成都 610059
  • 折叠

摘要

关键词自动识别与标注算法在酒类历史文献自动分析和机器识别理解领域中有重要价值.首先采用YOLOv7网络模型进行酒文献的文本框识别,接着引入CBAM注意力机制获得文本框位置、大小等特征,然后采用PaddleOCR算法实现酒文献的关键词识别,最后应用文字修补技术进行优化处理.应用该检测算法设计的实验分析系统能高效处理海量酒文献数据,以90%的识别率提取文献中与酒类相关的文字信息,能有效克服酒文献中存在的文字印刷模糊不完整、字体种类多样的特殊情形,实验中取得了较好的识别标注效果.

Abstract

Keyword automatic recognition and annotation algorithms have important value in automatic analysis and machine recognition understanding of alcoholic history literature.Firstly,the YOLOv7 network model is used for text box recognition in liquor literature.Secondly the CBAM attention mechanism is introduced to obtain features such as text box position and size.Thirdly,the PaddleOCR algorithm is used to achieve keyword recognition in liquor literature.Finally,text repair technology is applied for optimization processing.The experimental analysis system designed using this detection algorithm can efficiently pro-cess massive alcohol literature data,extract text information related to alcohol with a 90%recognition rate,and effectively over-come the special situation of blurred and incomplete text printing and diverse font types in alcohol literature,achieving good rec-ognition and annotation results in the experiment.

关键词

深度学习/卷积神经网络/文字识别/酒文献

Key words

deep learning/convolutional neural networks/text recognition/wine literature

引用本文复制引用

基金项目

四川省哲学社会科学重点研究基地中国酒史研究中心开放基金项目(ZGJS2021-03)

四川省科技计划重点研发项目(2021YFG0029)

出版年

2024
宜宾学院学报
宜宾学院

宜宾学院学报

CHSSCD
影响因子:0.185
ISSN:1671-5365
参考文献量4
段落导航相关论文