首页|结合注意力机制和Mengzi模型的短文本分类

结合注意力机制和Mengzi模型的短文本分类

扫码查看
如何使用短文本分类技术挖掘有用的文本信息,是当前热门的研究方向之一.为了解决短文本特征信息稀疏和特征信息难以提取的问题,提出一种Mengzi-ADCBU短文本分类模型,该模型利用Mengzi预训练模型将输入的文本信息转化为相应的文本表示,再将获得的文本向量分别输入改进的深度金字塔卷积神经网络和融合了多头注意力机制的双向门控单元中提取文本特征信息,将两者提取到的特征信息进行融合之后,输送给全连接层和Softmax函数完成短文本分类.在公开的短文本数据集THUCNews和SougouCS上分别进行多组模型对比实验,实验结果表明本文提出的Mengzi-ADCBU模型在短文本分类的准确率、精确度、召回率和F1值等评价指标上都比现在的主流模型性能更优,具有较好的短文本分类能力.
Short Text Classification Combining Attention Mechanism and Mengzi Model
How to use short text classification technology to mine useful text information is one of the current hot research direc-tions.To solve the problem of sparse feature information and difficult extraction of short text,a short text classification model named Mengzi-ADCBU is proposed.This model uses Mengzi pre-training model to convert input text information into correspond-ing text representation.Then,the obtained text vectors are input to the improved deep pyramid convolutional neural network and the bidirectional gated unit integrated with multi-head attention mechanism to extract text feature information,and the extracted feature information is fused and sent to the full connection layer and Softmax function to complete short text classification.Multiple models comparison experiments are carried out on the publicly available THUCNews short text data set and SougouCS short text data set respectively.The experimental results show that the proposed Mengzi-ADCBU model is better than the current mainstream models in the accuracy,precision,recall rate and F1 value of short text classification and has better short text classification ability.

short textmulti-head attentiondeep pyramid convolutional neural netwrksbidirectional gated unit

陈雪松、李衡、王浩畅

展开 >

东北石油大学电气信息工程学院,黑龙江 大庆 163318

东北石油大学计算机与信息技术学院,黑龙江 大庆 163318

短文本 多头注意力 深度金字塔卷积神经网络 双向门控单元

国家自然科学基金资助项目国家自然科学基金资助项目

6140209961702093

2024

计算机与现代化
江西省计算机学会 江西省计算技术研究所

计算机与现代化

CSTPCD
影响因子:0.472
ISSN:1006-2475
年,卷(期):2024.(9)
  • 16