首页|基于异构图和语义融合的实体关系抽取

基于异构图和语义融合的实体关系抽取

扫码查看
关系抽取是信息抽取中的一项重要任务,其目的是从非结构化文本中抽取出所有关系三元组。然而,如何有效地处理这一问题仍然是一个挑战,特别是对于关系重叠问题。为了有效处理重叠问题,该文提出一种基于异构图和语义融合的实体关系抽取方法:使用异构图将关系信息作为先验知识融入词表示,增强词表示的表示能力,使得模型能有效地处理单词实体重叠问题;使用语义融合模块将不同层次特征融合在一起作为关系分类模型的输入,使得模型能够有效地处理实体对重叠问题。所提方法在NYT和WebNLG数据集上取得了最好的效果,详细的实验也表明所提方法可以处理复杂的场景。
Entity-relation extraction based on heterogeneous graphs and semantic fusion
[Objective]Relational extraction,which involves the extraction of all relational triples from unstructured text,is an important task in natural language processing.However,effectively addressing the problem of overlapping entity relations remains a challenge.Entity-relation overlap is a significant challenge in entity-relation extraction within natural language processing.Entity-relation overlap refers to the phenomenon in which an entity may have relationships with more than one entity or where multiple relationships exist between pairs of entities.[Methods]To better address the issue of relation overlap,this study proposes an entity-relation extraction method based on heterogeneous graphs and semantic fusion.The overall strategy is to first extract entities and then classify different pairs of entities into specific relationships.This approach effectively addresses the problem of single-entity overlap.To maximize entity extraction,heterogeneous layers are used to integrate predefined relationships as relational prior information into word representation.This enhances representation capability,making it more conducive to entity annotation tasks and reducing the extraction of redundant entities.After the entities are obtained,a global association matrix is employed to filter out entity pairs that do not have relational connections,thereby ensuring that only the correct entity pairs are selected.To better classify the relationship types between entity pairs with relational connections,a semantic fusion module is used to aggregate features at different levels as the input for the relational classification module.This can improve the performance of relational classification and address the problem of entity-pair overlap.[Results]Experimental results demonstrate that the proposed models outperform other benchmark models on the NYT and WebNLG datasets.Specifically,for the NYT data set,the proposed method improves the Fl value by 0.3%compared with the best existing method,and for the WebNLG data set,it improves the Fl value by 0.7%compared with the best model.Compared with RIFRE,the proposed model uses a semantic fusion module to aggregate multigranularity information in the subsequent decoding process,resulting in better quantitative performance.To further explore the effectiveness of the proposed model in handling overlapping entity-relation triples,two extended experiments for different sentence types are designed and performed.[Conclusions]The results of these extended experiments show that,for the WebNLG dataset,the proposed model outperforms other models in terms of processing different types of sentences and handling complex scenarios.For the NYT data set,the proposed model outperforms the benchmark model in extraction and the handling of complex scenarios.Even for nonoverlapping sentences,the proposed model achieves superior results.This indicates that the proposed method can effectively address complex scenarios and various types of overlapping problems.Experiments show that the entity-relation extraction method based on heterogeneous graphs and semantic fusion can effectively manage overlapping issues and extract entity-relation triples.Detailed experiments also confirm that the proposed method can handle complex scenarios.

entity-relation extractionheterogeneous graphsemantic fusionrelationship overlapentity relationship triples

唐贤伦、丁河长、唐瑜泽、谢涛、罗洪平

展开 >

重庆邮电大学自动化学院,重庆 400065

实体关系抽取 异构图 语义融合 关系重叠 实体关系三元组

重庆市研究生教育教学改革研究重大项目重庆市自然科学基金面上项目

yjg241008CSTB2022NSCQ-MSX0380

2024

实验技术与管理
清华大学

实验技术与管理

CSTPCD北大核心
影响因子:1.651
ISSN:1002-4956
年,卷(期):2024.41(8)