大语言模型多语言词对齐能力评测方法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：针对目前大语言模型多语言词对齐能力评测相对缺乏的问题,提出一种通过跨语言选词填空任务评测其多语言词对齐能力的方法:根据词汇的上下文长短、词性、干扰词数量等规则生成多维度的跨语言选词填空评测数据集,并用其对多种大语言模型的多语言词对齐能力进行评测.在以中文和英文两种语言以及代表性大语言模型为例的实验结果显示,多个大语言模型在中英跨语言选词填空任务上的准确率超过80％,最高达90.4％,证明大语言模型具有较优的多语言词对齐能力.不仅可为评测大语言模型多语言词对齐能力提供方法和测试数据,也可为多语言共同性和跨语言处理任务研究者提供模型选择建议.

外文标题：Evaluation methods for multilingual word alignment capabilities in large language models

外文摘要：To solve the problem that the relative lack of current evaluations on the multilingual word alignment capabilities of Large Language Models,this paper introduced a method to evaluate these capabilities through a cross-lingual cloze.The method involved generating a multidimensional evaluation datasets based on rules such as the context length,part of speech,and number of distractor words of the vocabulary.This datasets were used to test the multilingual word alignment abilities of various Large Language Models.Experimental results,using representative Large Language Models and focusing on Chinese and English,dem-onstrated that these models achieved an accuracy rate of over 80％on cross-lingual cloze tasks,with the highest reaching 90.4％.This performance confirmed the strong multilingual alignment capabilities of Large Language Models.The study not only offered a methodology and data for evaluating multilingual word alignment capabilities but also provided model selection recom-mendations for researchers in multilingual universality and cross-lingual processing tasks.

外文关键词：

large language modelmultilingual word alignment ability evaluationcross-lingual cloze

作者：

李洁、李正芳、邹垚、熊大卫、胡建

展开 >

作者单位：

西南民族大学计算机与人工智能学院,四川成都 610041

关键词：

大语言模型多语言词对齐能力评测跨语言选词填空

出版年：

2024

DOI：

10.11920/xnmdzk.2024.06.012

西南民族大学学报(自然科学版)

西南民族大学

西南民族大学学报(自然科学版)

CSTPCD

影响因子：0.441

ISSN：2095-4271

年,卷(期)：2024.50(6)