首页|融合句法信息的实体关系联合抽取

融合句法信息的实体关系联合抽取

扫码查看
实体关系抽取是自然语言处理领域知识图谱构建的关键技术之一,有助于知识图谱自动化更新和扩充,并为下游任务提供重要的知识库支持。目前实体关系抽取方法大多从单一角度进行特征提取,导致特征表达能力不足,同时级联错误累积现象严重,无法较好针对实体关系重叠、实体嵌套现象进行适配,极大地影响实体关系抽取的精度和效率。为了同时解决这些问题,提出了一种融合语义和依存句法信息的实体关系联合抽取方法。该方法采用预训练语言模型BERT提取语义特征;然后利用句法注意力图卷积神经网络获取依存句法特征;最终,融合语义特征和依存句法特征对句子中多个关系的主客实体位置进行预测标记。实验结果表明,所提模型在NYT和WebNLG公共数据集上的F1值分别达到了92。8%和91。1%,与基线模型和其他深度学习模型相比,模型在重叠实体抽取上取得了较好的效果,验证了模型的有效性。
Syntactic Information Fused Joint Entity Relation Extraction
Entity relation extraction is one of the key task of knowledge graph construction in natural language processing.It helps to update and expand the knowledge graph automatically,and provides important knowledge base support for downstream tasks.At present,most entity relationship extraction methods extract features from a single perspective,resulting in insufficient feature expression ability.Meanwhile,the accumulation of cascading errors is severe,making it difficult to adapt well to the phenomenon of overlapping and nested entity relationships,greatly affecting the accuracy and efficiency of entity relationship extraction.To solve these problems at the same time,we propose a new joint entity relation extraction method that combines semantic and dependency syntactic information.First,pre-trained language model BERT is used to extract semantic features.Then,syntactic attention graph convolutional network is used to obtain syntactic features of fusion dependency information.Finally,dependency syntactic features and semantic features are combined to predict the position of subject and object entities in multiple relationships in a sentence.Experimental results show that the Fl value of the proposed model on NYT and WebNLG public data sets reaches 92.8%and 91.1%respectively.Compared with the baseline model and other deep learning models,the proposed model achieves better results in overlapping entity extraction,which verifies its effectiveness.

relation extractionsyntactic dependency analysisgraph convolution neural networkfeature fusionrelationship overlap

胡翼、于海、郭鑫、陈千、廖健、郑建兴、李艳红、杨可涵

展开 >

山西大学计算机与信息技术学院,山西太原 030006

中国移动通信集团山西有限公司,山西太原 030024

关系抽取 句法依存分析 图卷积神经网络 特征融合 关系重叠

国家自然科学基金山西省自然科学基金山西省自然科学基金山西省自然科学基金CCF智谱AI大模型基金

620761582022030212202120210302123468202203021221001CCF-Zhipu202310

2024

计算机技术与发展
陕西省计算机学会

计算机技术与发展

CSTPCD
影响因子:0.621
ISSN:1673-629X
年,卷(期):2024.34(8)