首页|基于注意力和图网络的多粒度脚本事件推理方法

基于注意力和图网络的多粒度脚本事件推理方法

扫码查看
[目的]对于一些覆盖面广、延续时间长的事件(如台风、洪水等自然灾害),考虑事件演化过程复杂多变,所获取的事件信息存在疏密程度不均的问题,研究其事件推理问题.[方法]提出一种基于注意力机制和图网络的多粒度脚本事件推理方法,通过提取及合并不同粒度的事件信息以获得更丰富的事件语义信息,并结合图注意力网络和事件转移概率矩阵以充分利用密集事件连接提升推理性能.具体而言,首先采用自注意力机制从事件链中获得事件片段,并结合单个事件和事件片段的表示扩展模型输入信息,然后充分利用密集的事件连接,从海量事件新闻中提取事件链构建事件演化图来获得事件转移矩阵,提高模型推理准确率.[结果]在多个真实台风事件以及标准数据集上的实验结果表明,本文所提方法优于传统的基于事件链、基于事件对以及基于事件图等的事件推理模型.[结论]采用自注意力机制从事件链中获得事件片段,并结合单个事件和事件片段的表示扩展模型输入信息可以缓解事件稀疏带来的推理困难问题;仅采用时序关系不能完全表示基本事件链与候选事件的关系,构建事件演化图可以更好的提取事件间的复杂交互信息;多粒度事件信息的融合可以更好地反映候选事件与基本事件链的关系.
A multi-granularity script event prediction method based on attention and graph networks
[Objective]When given an existing event context,script event prediction(SEP)aims to predict the subsequent counterpart.However,some disaster events,such as typhoons and floods,due to the complex event evolution process,their event contexts endure different degrees of information density,prompting the event reasoning very challenging.In this paper,we propose a multi-granularity SEP method based on attention mechanism and graph network to solve two problems in complex event reasoning,namely,how to merge different levels of event information to obtain deeper semantic information,and how to fully utilize dense event connections to improve the reasoning ability of the model.[Methods]First,we utilize the dependent syntax and semantic role labeling capabilities of the Language Technology Platform to extract event triples from News texts and construct event chains.Next,we employ the self-attention mechanism to obtain event fragments and merge them with the representations of individual events as inputs to the model.In addition,event chains are extracted from large-scale News texts to construct event evolution graphs and obtain transfer probability matrices between events.Based on these event representations and probability matrices,we construct a three-layer graph attention network(GAT)for inference.[Results]The performance of our proposed method is compared with other eight baseline methods on the benchmark dataset New York Times(NYT).To verify the effectiveness of the proposed method on the disaster events,we also construct a real dataset,which contains 20 typhoons and more than 2 800 related News reports.Overall experimental results on the typhoon dataset and benchmark dataset show that the accuracy of the proposed method reaches as high as 88.2%on the typhoon dataset and 54.23%on the NYT dataset,more satisfactory than other statistical and neural network-based methods.Specifically,on typhoon dataset EventComp,PairLSTM,and LSTM models,in which only strong temporal relationships in the event chain are considered,accuracies of 79.32%,78.45%,and 73.46%are obtained,respectively.On NYT,accuracy of 49.57%,50.83%,and 45.53%are obtained,respectively.The graph-based model SGNN performs more satisfactorily on both the typhoon dataset(87.27%)and the NYT dataset(52.45%)than those event-pair-based models do.Moreover,to investigate the intrinsic mechanisms of how the model improves event prediction performance,we conduct a series of ablation experiments on the typhoon dataset.Experimental results show that removing the event fragment representation leads to a significant performance degradation of 2.6 percentage points,thus highlighting the importance of event fragment representation in the model.Similarly,removing the attention mechanism in the event inference layer triggers a 0.4 percentage point performance degradation,likewise demonstrating the critical role of the attention mechanism in extracting valid information.[Conclusions]The proposed multi-granularity event inference method based on the attention mechanism and graph network attains significant improvements compared with those existing methods.Extracting and combining event information at different granularities,we can obtain richer event semantic information.Also,combining graph attention networks and event transition probability matrices,we can fully utilize dense event connections to enhance inference performance.The experimental analysis shows that the event fragment representation and graph-based learning method constitute the two major factors for enhancing prediction performance.The former shows that the event chain contains key correlation information among events,whereas the latter shows the event evolution graph can extract more effective interaction information among events and the event inference task can perform more satisfactorily.

event predictiongraph attention networkattentionevent evolution graph

倪进鑫、蒋晨辉、周绮凤

展开 >

厦门大学航空航天学院,福建厦门 361102

厦门市大数据智能分析与决策重点实验室,福建厦门 361102

事件推理 图注意力网络 注意力 事件演化图

国家自然科学基金

62171391

2024

厦门大学学报(自然科学版)
厦门大学

厦门大学学报(自然科学版)

CSTPCD北大核心
影响因子:0.449
ISSN:0438-0479
年,卷(期):2024.63(2)
  • 18