微型电脑应用2024,Vol.40Issue(2) :18-20,25.

基于词向量融合的建筑文本分类方法研究

Research on Architectural Text Classification Method Based on Word Vector Fusion

胡少云 翁清雄
微型电脑应用2024,Vol.40Issue(2) :18-20,25.

基于词向量融合的建筑文本分类方法研究

Research on Architectural Text Classification Method Based on Word Vector Fusion

胡少云 1翁清雄1
扫码查看

作者信息

  • 1. 中国科学技术大学,管理学院,安徽,合肥 230026
  • 折叠

摘要

由于建筑领域问题包含复杂多样的领域专有术语,常见的文本分类算法在建筑领域问题分类上难度较大.为提高建筑领域问题的分类性能,提出一种基于融合RoBERTa和Word2Vec的建筑文本分类算法.实验结果表明:在建筑领域问题数据集上,准确率达到91.59%,分类性能较好;在通用数据集上,准确率均高于SVM、CNN等模型.

Abstract

Due to the complexity and variety of domain-specific terms in architectural questions,the common text classification algorithms are more difficult to classify architectural questions.In order to improve the classification performance of questions in the architectural field,this paper proposes an architectural text classification algorithm based on the fusion of RoBERTa and Word2Vec.Experimental results show that the accuracy rate of the proposed method reaches 91.59%on the construction do-main problem dataset,and the classification performance is better,and on general data sets,the accuracy rate is higher than that of SVM,CNN and other models.

关键词

文本分类/预训练语言模型/句向量/深度学习/问答系统

Key words

text classification/pretrained language model/sentence vector/deep learning/question-answering system

引用本文复制引用

基金项目

国家自然科学基金国际(地区)合作与交流项目(7191001010)

出版年

2024
微型电脑应用
上海市微型电脑应用学会

微型电脑应用

CSTPCD
影响因子:0.359
ISSN:1007-757X
参考文献量7
段落导航相关论文