基于对抗训练的事件要素识别方法

Event element recognition method based on adversarial training

廖涛 ¹沈文龙 ¹张顺香 ¹马文祥¹

扫码查看

作者信息

1. 安徽理工大学计算机科学与工程学院,安徽淮南 232001
折叠

摘要

针对目前大多数事件要素识别模型未考虑词级别的语义信息,及模型鲁棒性不高的问题,提出一种融合词信息和对抗训练的事件要素识别方法.将Bert(bidirectional encode representations from transformers)预训练语言模型生成的字向量与分词信息进行融合,在得到的融合向量中添加扰动因子产生对抗样本,将对抗样本与融合向量表示作为编码层的输入;采用BiGRU(bidirectional gating recurrent unit)网络对输入的文本进行编码,丰富文本的上下文语义信息;采用CRF(conditional random field)函数计算完成事件要素的识别任务.在CEC(Chinese emergency corpus)中文突发事件语料库上的实验结果表明,该方法能够取得较好的效果.

Abstract

To solve the problems that most of the current event element recognition models do not consider the semantic informa-tion of the word level,and the robustness of the models is not high,an event element recognition method that combined word information and adversarial training was presented.The word vectors generated using the Bert(bidirectional encode representa-tions from transformers)pre-trained language model were fused with word segmentation information.Disturbance factors were added to the obtained fusion vectors to generate adversarial samples.The adversarial samples and fusion vectors were represented as inputs to the encoding layer.BiGRU(bidirectional gating recurrent unit)network was used to encode the input text to enrich the context semantic information of the text.The CRF(conditional random field)function was used to calculate and complete the task of identifying event elements.Experimental results on the CEC(Chinese emergency corpus)show that the method can achieve acceptable results.

关键词

事件要素识别/鲁棒性/词信息/对抗训练/预训练语言模型/扰动因子/上下文语义信息

Key words

event element recognition/robustness/word information/adversarial training/pre-trained language model/distur-bance factor/contextual semantic information

引用本文复制引用

基金项目

国家自然科学基金面上基金项目(62076006)

安徽省属高校协同创新基金项目(GXXT-2021-008)

安徽省自然科学基金面上基金项目(1908085MF189)

出版年

2024

计算机工程与设计

中国航天科工集团二院706所

计算机工程与设计

CSTPCD北大核心

影响因子：0.617

ISSN：1000-7024

参考文献量16

段落导航