海军工程大学学报2024,Vol.36Issue(1) :76-82,93.DOI:10.7495/j.issn.1009-3486.2024.01.012

基于RoBERTa-Span-Attack的标签指针网络军事命名实体识别

Military named entity recognition based on RoBERTa-Span-Attack tag pointer network

罗兵 张显峰 段立 陈琳
海军工程大学学报2024,Vol.36Issue(1) :76-82,93.DOI:10.7495/j.issn.1009-3486.2024.01.012

基于RoBERTa-Span-Attack的标签指针网络军事命名实体识别

Military named entity recognition based on RoBERTa-Span-Attack tag pointer network

罗兵 1张显峰 1段立 1陈琳1
扫码查看

作者信息

  • 1. 海军工程大学电子工程学院,武汉 430033
  • 折叠

摘要

军事领域文本中存在大量军事实体信息,准确识别这些信息是军事文本信息提取和构建军事知识图谱的基础性任务.首先,提出了一种基于RoBERTa预训练模型、跨度和对抗训练的标签指针网络的融合深度模型(RoBERTa-Span-Attack),用于中文军事命名实体识别;然后,采用了一种基于Span的标签指针网络,同时完成实体的起止位置和类别的识别任务;最后,在模型训练过程中加入对抗训练策略,通过添加一些扰动来生成对抗样本进行训练.在军事领域数据集上的实验结果表明:所提出的军事领域命名实体识别模型相较于BERT-CRF、BERT-Softmax和BERT-Span,在识别准确度上具有更优的效果.

Abstract

There are plenty of military entities in the documents of military field.Identification of such information is the basic task of extracting military text information and constructing military know-ledge graph.A model based on robustly optimized BERT pre-training approach(RoBERTa)Span and confrontation training label pointer network(RoBERTa-Span-Attack)was proposed,which was used for Chinese military named entity recognition.Because RoBERTa adopts the pre-training strategy of whole word mask,it has learned the semantic representation for the whole word,which is more sui-table for the recognition of Chinese military named entities.And then,a span-based label pointer net-work which can recognize the starting-end position and label of entities at the same time was adopted to improve the model performance.Finally,adversarial training strategy in which disturbances were added to generate adversarial samples for training process was employed to improve the robustness of the model.Experimental results on military domain dataset demonstrate that the proposed model has better recognition accuracy than BERT-CRF,BERT-Softmax and BERT-Span.

关键词

军事命名实体识别/预训练模型/跨度/标签指针网络/对抗训练

Key words

military named entity recognition/pre-trained model/span/label pointer network/ad-versarial training

引用本文复制引用

出版年

2024
海军工程大学学报
海军工程大学

海军工程大学学报

CSTPCD北大核心
影响因子:0.34
ISSN:1009-3486
参考文献量16
段落导航相关论文