首页|基于文本挖掘的ISO标准术语自动识别与标准术语知识图谱构建研究

基于文本挖掘的ISO标准术语自动识别与标准术语知识图谱构建研究

扫码查看
ISO标准术语蕴含特定的领域知识,是ISO标准文本数据的重要组成.在标准数字化转型下,ISO术语自动识别技术面临迫切的发展需求.本研究通过深入分析ISO标准术语的编写要求,总结了ISO标准术语核心要素的文本特性,基于此采用基于规则的文本挖掘方法构建了ISO标准术语自动识别模型及结构化和可视化加工路径,在ISO 26262标准上完成验证与应用,生成ISO 26262的标准术语知识图谱.本研究的技术路径能够为ISO标准实体抽取和相关标准数字化平台的构建提供一定的参考.
Research on Automatic Recognition of ISO Standard Terminology and Construction of Standard Terminology Knowledge Graph Based on Text Mining
ISO standard terminology contains specific domain knowledge and is an important component of ISO standard text data.In the context of the digital transformation of standards,ISO terminology automatic recognition technology is facing urgent development needs.This study conducted an in-depth analysis of the requirements for writing ISO standard terminology and summarized the text characteristics of the core elements of ISO standard terminology.Based on this,a rule-based text mining method was used to construct an automatic recognition model for ISO standard terminology and a structured and visualization processing path.The model was validated and applied on the ISO 26262 series of standards.The study can provide some reference for the extraction of ISO standard entities and the construction of related standard digital platforms.

ISOinternational standardterminology automatic recognitionstandard digitizationtext mining

方思怡

展开 >

上海市质量和标准化研究院

ISO 国际标准 术语自动识别 标准数字化 文本挖掘

2024

标准科学
中国标准化研究院 中国标准化协会

标准科学

影响因子:0.32
ISSN:1674-5698
年,卷(期):2024.(8)