DRE-3DC:基于三维表征建模的篇章级关系抽取模型

DRE-3DC:Document-Level Relation Extraction with Three-Dimensional Representation Combination Modeling

王宇 ¹王震 ¹温立强 ¹李伟平 ¹赵文²

扫码查看

作者信息

1. 北京大学软件与微电子学院,北京 100871
2. 北京大学软件工程国家工程研究中心,北京 100871
折叠

摘要

篇章级关系抽取任务旨在从非结构化文档的多个句子中提取事实,是构建领域知识库和知识问答应用的关键环节,相较于句子级关系抽取,该任务既要求模型能够基于文档结构特征捕获实体间的复杂交互,还要应对严重的关系类别长尾分布问题.现有基于表格的关系抽取模型主要对文档进行"实体/实体"二维建模,采用多层卷积网络或局部注意力机制提取实体间的交互特征,由于未显式对关系语义进行解耦建模,使得模型无法避免类别重叠影响和捕获关系的方向性特征,导致缺乏实体交互的充分语义信息.针对上述挑战,本文提出了一种基于三维表征建模的篇章级关系抽取模型DRE-3DC(Document-Level Relation Extraction with Three-Dimensional Representation Combination Modeling),对二维表格建模方式进行扩展,形成"实体/实体/关系"三维表征建模,采用基于形变卷积的三重注意力机制有效区分和聚合不同语义空间下的实体间及实体与关系的交互表征,自适应地增强模型对文档结构特征的聚合.同时,采用多任务学习方法增强模型对文档整体关系类别组合的感知来缓解篇章级关系抽取任务中的关系类别长尾分布问题.在DocRED和Revisit-DocRED两个篇章级关系抽取数据集上进行的实验结果表明,DRE-3DC模型性能良好,并通过消融实验、对比分析和实例分析,验证了本文所提方法的有效性.

Abstract

The task of document-level relation extraction aims to extract facts from multiple sentences of unstructured documents,which is a key step in the construction of domain knowledge graph and knowledge answering application.The task requires that the model not only capture the complex interactions between entities based on the structural features of documents,but also deal with the serious long-tail category distribution problem.Existing table-based relation extraction models try to solve this issue,but they mainly model documents in two-dimensional"entity/entity"space,and use multi-lay-er convolutional network or restricted self-attention mechanism to extract the interaction features between entities,which cannot avoid the influence of category overlap and capture the directional features of relationships,resulting in the lack of decoupled semantic information of interaction.For the above challenges,this paper proposes a new document-level relation extraction model,named DRE-3DC(Document-Level Relation Extraction with Three-Dimensional Representation Combi-nation Modeling),in which the"entity/entity"modeling extend to the form of three-dimensional"entity/entities/relation-ship"modeling method.Based on the deformable convolution in triple attention mechanism,the model effectively distin-guishes and integrates the interaction features under different semantic space and adaptively captures the document structur-al features.At the same time,we propose a multi-task learning method to enhance the perception of relation category combi-nation of documents to alleviate the long-tail distribution problem.The experimental results reveal better score on DocRED and Revisit-DocRED dataset respectively.The effectiveness of the proposed method was verified by ablation experiment,comparative analysis and example analysis.

关键词

篇章级关系抽取/三维表征/三重注意力/形变卷积网络/多任务学习

Key words

document-level relation extraction/three-dimensional representation/triplet attention/deformable con-volution/multi-task learning

引用本文复制引用

基金项目

国家重点研发计划(2020YFC0833300)

出版年

2024

电子学报

中国电子学会

电子学报

CSTPCDCSCD北大核心

影响因子：1.237

ISSN：0372-2112

参考文献量41

段落导航