首页|面向公路工程规范的多粒度知识提取与知识应用方法

面向公路工程规范的多粒度知识提取与知识应用方法

扫码查看
针对公路工程领域知识繁多而应用效率低的问题,提出面向公路规范类文本的多粒度知识提取与知识应用方法.在词语粒度上构建了公路工程领域词库;在语段粒度上提出TEARS定义,将复杂语段转换为结构化的三元组结构;在子句粒度上总结了四种主要句法,并各自设计了语义信息的抽取方法.以 967 本公路规范类文本为数据源,从中提取知识并构建了公路工程领域知识图谱,通过与深度学习方法比较验证了正确性,开发公路工程安全信息检索与应用系统.结果表明:该方法实现了非结构化公路规范类文本的知识提取,构建的领域知识图谱质量较高,满足工程应用需求.
Multi-Level Knowledge Extraction and Application Methods for Highway engineering Specifications
Aiming at the problem of low efficiency in searching huge domain knowledge in the field of highway engineering,a multi-level knowledge extraction method for highway engineering specifications is proposed in the present paper.In the word level,a domain lexical database of highway engineering is constructed.In the paragraph level,a TEARS definition for highway engineering specifications is proposed,therefore unstructured paragraphs can be converted into structured triples.In the sentence level,four main sentence structures and their extraction methods for semantic information are designed respectively.The research constructs a domain knowledge graph of highway engineering by using above established methods and taking 967 highway engineering specifications as data source,and further develops a highway engineering safety information searching and application system.The result shows that the proposed methods can successfully extract knowledge from highway engineering specifications,and the constructed domain knowledge graph can fully meet the needs of engineering applications.

Highway EngineeringInformation ExtractionNatural Language ProcessChinese SpecificationKnowledge Graph

孙克强、张嘉鸿、伍震、胡振中

展开 >

广东省路桥建设发展有限公司,广州 510623

清华大学 土木工程系,北京 100084

清华大学 深圳国际研究生院,深圳 518055

公路工程 信息抽取 自然语言处理 中文规范 知识图谱

国家重点研发计划

2022YFC3801100

2024

土木建筑工程信息技术
中国图学学会

土木建筑工程信息技术

影响因子:0.933
ISSN:1674-7461
年,卷(期):2024.16(1)
  • 21