软件导刊2024,Vol.23Issue(12) :82-91.DOI:10.11907/rjdk.232212

基于改进全局指针的惠企政策命名实体识别方法

Improved Global Pointer Based Named Entity Recognition Method for Enterprise-benefiting Policies

杨虔懿 喻金平
软件导刊2024,Vol.23Issue(12) :82-91.DOI:10.11907/rjdk.232212

基于改进全局指针的惠企政策命名实体识别方法

Improved Global Pointer Based Named Entity Recognition Method for Enterprise-benefiting Policies

杨虔懿 1喻金平1
扫码查看

作者信息

  • 1. 江西理工大学 信息工程学院,江西 赣州 341000
  • 折叠

摘要

中小微企业在国民经济中具有重要地位.近年来,国家推出的多种惠企政策包含政府决策关键信息.然而,政策文本结构复杂、专业语义性依赖较强,且含有噪声文本与嵌套实体,信息提取难度高.为此,提出一种基于多级词汇全局指针与对抗训练的命名实体识别模型.该模型在嵌入层融合LEBERT模型获取字符与词汇的组合语义表示,通过全局指针构建全局实体矩阵,统一处理扁平和嵌套实体;同时引入旋转式位置编码提升对位置信息的感知力,并结合对抗训练增强稳定性和鲁棒性.实验结果表明,该模型的F1值为81.90%,与经典的基于序列标注的模型相比提升了4.72%,整体性能支持下游任务开展.

Abstract

Small and medium-sized enterprises play an important role in the national economy.In recent years,various preferential policies for enterprises introduced by the government have included key information for government decision-making.However,policy texts have com-plex structures,strong dependence on professional semantics,and contain noisy text and nested entities,making information extraction diffi-cult.Therefore,a named entity recognition model based on multi-level vocabulary global pointers and adversarial training is proposed.This model integrates the LEBERT model at the embedding layer to obtain the combined semantic representation of characters and vocabulary,and constructs a global entity matrix through global pointers to uniformly process flat and nested entities;Simultaneously introducing rotary posi-tion encoding to enhance the perception of position information,and combining it with adversarial training to enhance stability and robustness.The experimental results show that the F1 value of the model is 81.90%,which is 4.72%higher than the classical sequence annotation based model.The overall performance supports downstream task development.

关键词

命名实体识别/惠企政策/预训练模型/全局指针/对抗训练

Key words

named entity recognition/enterprise-benefiting policies/pre-training model/global pointer/adversarial training

引用本文复制引用

出版年

2024
软件导刊
湖北省信息学会

软件导刊

影响因子:0.524
ISSN:1672-7800
段落导航相关论文