大语言模型的汉语框架语义分析能力评估

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：大语言模型的出现对自然语言处理产生了广泛的影响,已有研究表明大语言模型在各类下游任务中具有出色的Zero-shot及Few-shot能力,而对于大语言模型的语义分析能力的评估仍然比较缺乏.因此,本文基于汉语框架语义分析中的三个子任务:框架识别、论元范围识别和论元角色识别,分别在Zero-shot及Few-shot设定下评估了ChatGPT、Gemini和ChatGLM三个大语言模型在CFN2.0数据集上的语义分析能力,并与目前基于BERT(Bi-directional Encoder Representations from Transformers)的SOTA模型进行了比较.在框架识别任务中,大语言模型的准确率仅比SOTA模型低0.04;但在论元范围识别与论元角色识别任务上,大语言模型表现不佳,与SOTA(State of the Art)模型相比,F1分数分别相差0.13和0.39.以上结果表明,大语言模型虽具备一定的框架语义分析能力,但进一步提升大语言模型的语义分析能力仍然是一个具有挑战性的工作.

外文标题：Evaluation of Chinese Frame Semantic Analysis Capabilities of Large Language Models

外文摘要：The emergence of large language models(LLMs)has a widespread impact on natural language processing.Studies have shown that the LLMs have excellent Zero-shot and Few-shot capabilities in various downstream tasks,but the evaluation of the se-mantic analysis capabilities of the LLMs is still lacking.Therefore,based on three subtasks in Chinese frame semantic analysis:frame identification,argument identification,and role identification,this paper evaluates the semantic analysis capabilities of three LLMs,namely ChatGPT,Gemini,and ChatGLM,on the CFN2.0 dataset under Zero-shot and Few-shot settings,and compares them with the current BERT-based SOTA model.In the frame identification task,the accuracy of the LLMs is only 0.04 lower than that of the SOTA model.However,in the argument identification and role identification task,the performance of the LLMs is suboptimal,with F1 scores differing by 0.13 and 0.39,respectively compared to the SOTA model.The above results show that although the LLMs have certain frame semantic analysis capabilities,further improving the semantic analysis capabilities of LLMs is still a challenging work.

外文关键词：

large language modelframe identificationargument identificationrole identification

作者：

高俊杰、马博翔、闫智超、苏雪峰、李茹

展开 >

作者单位：

山西大学计算机与信息技术学院,山西太原 030006

山西工程科技职业大学现代物流学院,山西晋中 030609

计算智能与中文信息处理教育部重点实验室,山西太原 030006

关键词：

大语言模型框架识别论元范围识别论元角色识别

基金：

山西省科技合作交流专项项目山西省基础研究计划项目国家自然科学基金重点项目

项目编号：

20220404110101620220302121128661936012

出版年：

2024

DOI：

10.13451/j.sxu.ns.2014112

山西大学学报(自然科学版)

山西大学

山西大学学报(自然科学版)

CSTPCD北大核心

影响因子：0.287

ISSN：0253-2395

年,卷(期)：2024.47(5)