基于主题感知和语义增强的作文自动评分方法

扫码查看

原文链接

万方数据
维普

中文摘要：作文自动评分(AES)是教育领域中应用自然语言处理(NLP)技术的重要研究方向之一,其旨在提高评分效率,增强评价的客观性和可靠性.针对主题相关性缺失和长文本信息丢失问题以及预训练语言模型BERT不同层次能够提取不同维度特征的特点,提出一种基于主题感知和语义增强的作文自动评分模型.该模型采用多头注意力机制提取作文的浅层语义特征并感知作文主题特征,同时利用BERT的中间层句法特征和深层语义特征增强对作文语义的理解.在此基础上,融合不同维度的特征并用于作文自动评分.实验结果表明,该模型在公共数据集ASAP的8个子集上均表现出了显著的性能优势,相比于通义千问等基线模型,其能够有效提升作文自动评分性能,平均二次加权的卡帕值(QWK)达到80.25％.

外文标题：Automatic Essay Scoring Method Based on Topic Perception and Semantic Enhancement

外文摘要：Automatic Essay Scoring(AES)is an important research topic for the application of Natural Language Processing(NLP)technology in the field of education.AES aims to improve scoring efficiency and enhance the objectivity and reliability of evaluations.This study proposes a topic perception and semantic enhancement approach for AES,addressing the issues of missing thematic relevance and loss of information in long texts,as well as leveraging the different levels of feature extraction capability in the pre-training language model,Bidirectional Encoder Representations from Transformers(BERT).This approach utilizes a multi-head attention mechanism to extract shallow semantic features of an essay and perceive its thematic characteristics.Additionally,it leverages the mid-level syntactic and deep semantic features of BERT to enhance the understanding of the semantics of the essay.Finally,the fused features from different dimensions are used for the AES.Experimental results indicate that the proposed model exhibits significant performance advantages for eight subsets of the ASAP public dataset.The proposed model effectively improves the performance of AES compared to that of baseline models,such as Qwen-7B;its average Quadratic Weighted Kappa(QWK)is 80.25％.

外文关键词：

Automatic Essay Scoring(AES)semantic enhancementtopic perceptionfeature fusionpre-training language model

作者：

陈宇航、杨勇、先木斯亚·买买提明、帕力旦·吐尔逊、樊小超、任鸽、刁宇峰

展开 >

作者单位：

新疆师范大学计算机科学技术学院,新疆乌鲁木齐 830054

和田师范专科学校数学与信息学院,新疆和田 848000

内蒙古民族大学计算机科学与技术学院,内蒙古通辽 028000

关键词：

作文自动评分语义增强主题感知特征融合预训练语言模型

基金：

新疆维吾尔自治区自然科学基金国家自然科学基金国家自然科学基金国家自然科学基金

项目编号：

2021D01B72620660446216700862006130

出版年：

2024

DOI：

10.19678/j.issn.1000-3428.0068333

计算机工程

华东计算技术研究所　上海市计算机学会

计算机工程

CSTPCD北大核心

影响因子：0.581

ISSN：1000-3428

年,卷(期)：2024.50(8)

被引量1
参考文献量5