基于联合学习的语言粒度融合的重叠事件抽取方法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：事件抽取是一项重要的信息抽取任务,现有的事件抽取方法大多假设一个句子中仅出现一个事件,然而,在真实的场景下,重叠事件是难以避免的.文中提出了一种基于联合学习的语言粒度融合的重叠事件抽取方法.该方法设计了基于token数目逐层递增和逐层递减的策略,对不同语言粒度的片段进行表示,在此基础上,构建了渐进式语言粒度融合的句子表示.通过引入事件信息感知,建立了基于门控机制的语言粒度和事件信息融合的句子表示.最后,通过联合学习词间的片段关系和角色关系,实现对事件触发词、论元、事件类型和论元角色的判别.在FewFC和DuEE1.0-1数据集上进行了实验,所提LGFEE模型在事件类型判别任务上的F1值分别提高了0.8％和0.6％,在触发词识别、论元识别、论元角色分类任务上也获得了较高的召回率和F1值,验证了其有效性.

外文标题：Overlap Event Extraction Method with Language Granularity Fusion Based on Joint Learning

外文摘要：Event extraction is a crucial task in information extraction.The existing event extraction methods generally assume that only one event occurs in a sentence.However,overlapping events are inevitable in real scenarios.Therefore,this paper de-signs an overlap event extraction method with language granularity fusion based on joint learning.In this method,a strategy of in-creasing and decreasing token number layer by layer is designed to represent fragments of different language granularity.On this basis,a sentence representation of progressive language granularity fusion is constructed.By introducing event information per-ception,the sentence representation of language granularity and event information fusion based on gating mechanism is estab-lished.Finally,through the joint study of the fragment relationship and role relationship between words,the identification of event triggering words,arguments,event types and argument roles is realized.The experiments conducted on the FewFC and DuEE1.0-1 datasets demonstrate that the LGFEE model proposed in this paper achieves an improvement of 0.8％and 0.6％in the F1 score for event type discrimination tasks,respectively.Furthermore,it also exhibits higher recall rates and F1 scores in trigger word recognition,argument recognition,and argument role classification tasks,which verifies the validity of LGFEE model.

外文关键词：

Overlapping event extractionLanguage granularity fusionJoint learningAttention mechanismGating mechanism

作者：

闫婧涛、李旸、王素格、潘邦泽

展开 >

作者单位：

山西大学计算机与信息技术学院太原 030006

山西财经大学金融学院太原 030006

山西大学计算智能与中文信息处理教育部重点实验室太原 030006

关键词：

重叠事件抽取语言粒度融合联合学习注意力机制门控机制

基金：

国家重点研发计划国家自然科学基金山西省高等学校科技创新项目

项目编号：

2022QY0300-01621061302021L284

出版年：

2024

DOI：

10.11896/jsjkx.230700118

计算机科学

重庆西南信息有限公司（原科技部西南信息中心）

计算机科学

CSTPCD北大核心

影响因子：0.944

ISSN：1002-137X

年,卷(期)：2024.51(7)