首页|基于改进全局指针的惠企政策命名实体识别方法

基于改进全局指针的惠企政策命名实体识别方法

扫码查看
中小微企业在国民经济中具有重要地位.近年来,国家推出的多种惠企政策包含政府决策关键信息.然而,政策文本结构复杂、专业语义性依赖较强,且含有噪声文本与嵌套实体,信息提取难度高.为此,提出一种基于多级词汇全局指针与对抗训练的命名实体识别模型.该模型在嵌入层融合LEBERT模型获取字符与词汇的组合语义表示,通过全局指针构建全局实体矩阵,统一处理扁平和嵌套实体;同时引入旋转式位置编码提升对位置信息的感知力,并结合对抗训练增强稳定性和鲁棒性.实验结果表明,该模型的F1值为81.90%,与经典的基于序列标注的模型相比提升了4.72%,整体性能支持下游任务开展.
Improved Global Pointer Based Named Entity Recognition Method for Enterprise-benefiting Policies
Small and medium-sized enterprises play an important role in the national economy.In recent years,various preferential policies for enterprises introduced by the government have included key information for government decision-making.However,policy texts have com-plex structures,strong dependence on professional semantics,and contain noisy text and nested entities,making information extraction diffi-cult.Therefore,a named entity recognition model based on multi-level vocabulary global pointers and adversarial training is proposed.This model integrates the LEBERT model at the embedding layer to obtain the combined semantic representation of characters and vocabulary,and constructs a global entity matrix through global pointers to uniformly process flat and nested entities;Simultaneously introducing rotary posi-tion encoding to enhance the perception of position information,and combining it with adversarial training to enhance stability and robustness.The experimental results show that the F1 value of the model is 81.90%,which is 4.72%higher than the classical sequence annotation based model.The overall performance supports downstream task development.

named entity recognitionenterprise-benefiting policiespre-training modelglobal pointeradversarial training

杨虔懿、喻金平

展开 >

江西理工大学 信息工程学院,江西 赣州 341000

命名实体识别 惠企政策 预训练模型 全局指针 对抗训练

2024

软件导刊
湖北省信息学会

软件导刊

影响因子:0.524
ISSN:1672-7800
年,卷(期):2024.23(12)