首页|结合领域知识的标签生成方法研究

结合领域知识的标签生成方法研究

扫码查看
传统文本资源的标签生成算法忽略了与领域有关的语义属性,不适用于针对特定领域的标签生成任务。论文提出了一种适应于军事领域特征的标签生成算法,首先使用适合该领域的分词方法,进而基于文本资源的主题信息和词语的统计特征进行标签的自动生成。实验结果显示,所提方法在准确率、召回率及F值上较传统的TF-IDF算法有一定的提升。
Research on Tag Generation Method Combining Domain Knowledge
The traditional tag generation algorithm for text resources can not be well applied to the task of tag generation for specific fields because it ignores the semantic features related to fields.In this paper,a tag generation algorithm suitable for the char-acteristics of the military field is proposed.First,the paper uses the word segmentation method suitable for this field,and then auto-matically generates the tags based on the topic information of the text resources and the statistical characteristics of the words.The experimental results show that the proposed method has a certain improvement in accuracy,recall rate and F value compared with the traditional TF-IDF algorithm.

keyword extractiontag generationword segmentationLDA topic modelstatistical characteristics

景道月

展开 >

镇江市食品药品监督检验中心 镇江 212004

关键词抽取 标签生成 分词 LDA主题模型 统计特征

2024

计算机与数字工程
中国船舶重工集团公司第七0九研究所

计算机与数字工程

CSTPCD
影响因子:0.355
ISSN:1672-9722
年,卷(期):2024.52(5)
  • 18