基于软提示微调和强化学习的网络安全命名实体识别方法研究

扫码查看

原文链接

万方数据
维普

中文摘要：随着网络技术的迅猛发展,新型网络安全威胁不断涌现,网络安全命名实体识别重要性日益增加.针对现有基于大语言模型的命名实体识别方法在网络安全领域识别准确率差的问题,提出了一种结合软提示微调和强化学习的网络安全命名实体识别方法.通过结合软提示微调技术,针对网络安全领域的复杂性,精细调整大语言模型的识别能力,提升模型对网络安全命名实体的识别准确率,同时优化训练效率.此外,提出了基于强化学习的网络安全实体筛选器,可以有效去除训练集中的低质量标注,从而提升识别准确率.在2个开源基准网络安全实体识别数据集上评估了所提方法,实验结果表明,所提方法的F1值优于现有最佳的网络安全命名实体识别方法.

外文标题：Research on named entity recognition method in cybersecurity based on soft prompt tuning and reinforcement learning

外文摘要：As network technology rapidly advanced,new cybersecurity threats constantly emerged,increasing the impor-tance of cybersecurity named entity recognition.To address the problem of poor recognition accuracy in named entity recognition methods based on large language models in the cybersecurity domain,a novel cybersecurity named entity recognition method that combined soft prompt tuning and reinforcement learning was proposed.By integrating the soft prompt tuning technique,the method precisely adjusted the recognition capabilities of large language models to handle the complexity of the cybersecurity domain,improving recognition accuracy for cybersecurity named entities while opti-mizing training efficiency.Additionally,a reinforcement learning-based instance filter was proposed,which effectively removed low-quality annotations from the training set,further enhancing recognition accuracy.The proposed method was evaluated on two benchmark cybersecurity NER datasets,with experimental results demonstrating superior perfor-mance in F1 score compared to state-of-the-art cybersecurity NER methods.

外文关键词：

cybersecurity named entity recognitionsoft prompt tuningreinforcement learninglarge-scale pre-trained models

作者：

田泽庶、刘春雨、张云婷、张嘉宇、孟超、张宏莉

展开 >

作者单位：

哈尔滨工业大学计算学部,黑龙江哈尔滨 150001

关键词：

网络安全命名实体识别软提示微调强化学习大规模预训练模型

基金：

国家重点研发计划基金资助项目国家重点研发计划基金资助项目黑龙江省自然科学基金资助项目

项目编号：

2016QY03D05012017YFB0803304LH2023F018

出版年：

2024

DOI：

10.11959/j.issn.1000-436x.2024183

通信学报

中国通信学会

通信学报

CSTPCD北大核心

影响因子：1.265

ISSN：1000-436X

年,卷(期)：2024.45(10)