首页|基于TextCNN-Attention-BiLSTM融合模型的煤矿隐患文本分类研究

基于TextCNN-Attention-BiLSTM融合模型的煤矿隐患文本分类研究

扫码查看
为实现大量煤矿隐患文本的迅速、精确分类,及时了解安全概况并加以管理.首先,选取安全文库网中多个煤矿隐患数据库为实验数据源,对煤矿隐患文本进行预处理,包括去除噪声词、分词和词向量表示等;其次,利用TextCNN对文本进行卷积操作,提取不同尺寸的特征表示,再利用BiLSTM模型对得到的特征向量进行时序建模,并结合注意力机制(Attention),从而更好地关注文本中关键信息,捕捉文本全局语义信息;最后,利用全连接层的多标签分类器预测文本隐患类别.实验结果表明:TextCNN-Attention-BiLSTM融合模型在准确率、精确率、召回率和F1 值上均达到92%以上,为煤矿隐患文本分类提供了一种更加准确和有效的解决方案,对煤矿安全管理优化具有重要意义.
Research on Classification of Coal Mine Hidden Danger Texts Based on TextCNN-Attention-BiLSTM Fusion Model
In order to achieve quick and accurate classification of a large number of coal mine hazard texts,and timely un-derstand the safety situation for effective management,first,multiple coal mine hidden danger databases from the website of safety library were selected as experimental data sources,the coal mine hidden danger texts were preprocessed,including noise words removal,word segmentation,and word vector representation,etc.Then,the TextCNN(text convolutional neural network)was used to perform convolution operations on the texts,extracting feature representations of different sizes.The BiLSTM(bi-direc-tional long short-term memory)model was utilized to sequentially model the obtained feature vectors.Combined with the atten-tion mechanism(Attention),the model can better focus on key information in the texts and capture the global semantic informa-tion of the texts.Finally,a multi-label classifier in the fully connected layer was used to predict the categories of hidden dan-gers in the texts.Experimental results showed that the fused TextCNN-Attention-BiLSTM model achieves over 92%accuracy,precision,recall,and F1 value,providing a more accurate and effective solution for coal mine hidden danger text classification.It is of great significance for optimizing coal mine safety management.

coal mine safetyTextCNNattention mechanismBiLSTMtext classification

罗海平、曾向阳、陈勇

展开 >

武汉科技大学 资源与环境工程学院,湖北 武汉 430081

煤矿安全 TextCNN 注意力机制 BiLSTM 文本分类

国家自然科学基金项目

41971237

2024

武汉理工大学学报(信息与管理工程版)
武汉理工大学

武汉理工大学学报(信息与管理工程版)

CSTPCD
影响因子:0.37
ISSN:2095-3852
年,卷(期):2024.46(2)
  • 18