计算机技术与发展2023,Vol.33Issue(12) :156-162.DOI:10.3969/j.issn.1673-629X.2023.12.022

基于多掩码与提示句向量融合分类的立场检测

Stance Detection Based on Multi-mask and Prompt Sentence Vector Fusion Classification

王正佳 李霏 姬东鸿 滕冲
计算机技术与发展2023,Vol.33Issue(12) :156-162.DOI:10.3969/j.issn.1673-629X.2023.12.022

基于多掩码与提示句向量融合分类的立场检测

Stance Detection Based on Multi-mask and Prompt Sentence Vector Fusion Classification

王正佳 1李霏 1姬东鸿 1滕冲1
扫码查看

作者信息

  • 1. 武汉大学 国家网络安全学院 空天信息安全与可信计算教育部重点实验室,湖北 武汉 430072
  • 折叠

摘要

立场检测是指分析文本对于某一目标话题表达的立场,立场通常分为支持、反对和其他.近期的工作大多采用BERT等方法提取文本和话题的句语义特征,通常采用BERT首符号隐藏状态或者句子中每个词隐藏状态取平均作为句向量.该文对句向量的获取进行了改进,采用提示学习模板获取提示句向量,提高句向量的特征提取效果.设计了一种基于多掩码与提示句向量融合分类的立场检测模型(PBMSV),将提示句向量分类与多掩码的模板-答案器结构提示学习分类结合,向句向量引入文本、话题和立场词信息,融合句向量和答案器分类结果,对模型进行联合优化.在NLPCC中文立场检测数据集上的实验表明,在五个话题单独训练模型的实验中,该文方法与此前最优方法相比在三个目标上取得领先或持平,取得了79.3 的总F1 值,与最优方法接近,并在句向量对比实验中,验证了提示句向量的优势.

Abstract

Stance detection refers to the analysis of the stance expressed by the text on a target topic,which usually includes support,against and none.Existing works mostly use methods such as BERT to extract sentence feature vectors of the text and topic,and usually,the first token hidden state or the average of the hidden states of each word in the sentence is used as the sentence vector.We improve the acquisition of sentence vectors by using prompt learning templates to obtain prompt sentence vectors and enhance the feature extraction effect of sentence vectors.A stance detection model based on multiple masks and prompt sentence vector fusion classification is designed,which combines prompt sentence vector classification with the template-verbalizer structure of prompt learning classification with multiple masks,introducing text,topic,and stance words information into sentence vectors,fusing sentence vectors and verbalizer classification results,and jointly optimizing the model.Experiments on the NLPCC Chinese stance detection dataset show that in the ex-periments of training separate models for five topics,the proposed method is superior or comparable to the previous best method in three targets,achieving a total F1 value of 79.3,which is close to the best method.The advantage of prompt sentence vectors is verified in the sentence vector comparison experiment.

关键词

立场检测/深度学习/提示学习/句向量/多掩码

Key words

stance detection/deep learning/prompt learning/sentence vector/multi-mask

引用本文复制引用

基金项目

国家自然科学基金(62176187)

出版年

2023
计算机技术与发展
陕西省计算机学会

计算机技术与发展

CSTPCD
影响因子:0.621
ISSN:1673-629X
参考文献量7
段落导航相关论文