基于对比学习的大型语言模型反向词典任务提示生成方法

扫码查看

原文链接

万方数据
维普

中文摘要：反向词典任务是一种新兴的任务,目的是根据给定的定义来查找对应的单词.大规模语言模型为这一任务提供了新的可能性,但是提示语句的质量会影响大模型的性能.为此,提出了一种基于对比学习的提示生成方法.该方法在从多个语义层面上理解定义语义的同时,还利用对比学习的原理在训练过程中引入了负例,提升了模型的泛化能力.通过这种方法,可以将目标单词缩小到一个小范围内,然后用大模型从这个范围内选择最符合定义语义的单词.实验结果表明,该方法可以有效地提升大规模语言模型在反向词典任务上的表现.提示生成模型有94.7％的概率生成包含目标词的范围,大规模语言模型有58.03％的概率直接选出目标单词,有74.55％的概率在给出5个候选单词时包含目标单词.

外文标题：Contrastive Learning-based Prompt Generation Method for Large-scale Language Model Reverse Dictionary Task

外文摘要：Reverse dictionary task is an emerging task that aims to find the corresponding word based on a given definition.Large-scale language models offer new possibilities for this task,but the quality of the prompt sentences affects the performance of the large models.To this end,this paper proposes a contrastive learning-based prompt generation method.This method extracts definition semantics from multiple semantic levels.It also enhances the model's generalization ability by incorporating negative examples through contrastive learning.With this method,we can narrow down the target word to a small range,and use a large model to select the most semantically consistent word from this range.Experimental results show that the proposed method can effectively improve the performance of large-scale language models on the reverse dictionary task.The prompt generation model has a 94.7％probability of generating a range that contains the target word.The large-scale language model has a 58.03％pro-bability of directly selecting the target word,and a 74.55％probability of including the target word when five candidate words are given.

外文关键词：

Reverse dictionaryLarge-scale language modelContrastive learningMultiple semantic scalesContrastive loss

作者：

田思成、黄少滨、王锐、李熔盛、杜治娟

展开 >

作者单位：

哈尔滨工程大学计算机科学与技术学院哈尔滨 150001

生态大数据教育部工程研究中心内蒙古 010021

内蒙古大学计算机学院内蒙古 010021

关键词：

反向词典大规模语言模型对比学习多个语义层面对比损失

基金：

生态大数据教育部工程研究中心开放课题

项目编号：

出版年：

2024

DOI：

10.11896/jsjkx.230600204

计算机科学

重庆西南信息有限公司（原科技部西南信息中心）

计算机科学

CSTPCD北大核心

影响因子：0.944

ISSN：1002-137X

年,卷(期)：2024.51(8)