预训练神经机器翻译研究进展分析

扫码查看

原文链接

万方数据
维普

中文摘要：神经机器翻译(NMT)模型通常使用双语数据进行监督训练,而构建大规模双语数据集是一个巨大挑战.相比之下,大部分语言的单语数据集较为容易获取.近年来,预训练模型(PTM)能够在海量的单语数据上进行训练,从而得到通用表示知识,来帮助下游任务取得显著的性能提升.目前基于预训练的神经机器翻译(PTNM T)在受限资源数据集上已被广泛验证,但如何高效地在高资源NMT模型中利用PTM仍亟待研究.该文致力于对PTNMT的现状和相关问题进行系统性的整理和分析,从引入PTM的预训练方法、使用策略以及特定任务等角度对PTNMT方法进行详细的分类,并对PTNMT方法解决的问题进行总结,最后对PTNMT的研究进行展望.

外文标题：Pre-trained Neural Machine Translation:Progress and Analysis

外文摘要：Neural machine translation(NMT)models are usually trained using bilingual data.Building large-scale bi-lingual datasets is a huge challenge.In contrast,large-scale monolingual datasets for most languages are easier to construct.Pre-trained models(PTM)proposed in recent years can be trained on massive monolingual data.The ge-neric representation of knowledge learned through pre-training helps achieve significant performance gains in down-stream tasks.Currently pre-trained neural machine translation(PTNMT)has been extensively validated on resource-constrained datasets,but how to efficiently utilize PTM on high-resource NMT remains to be discussed.This paper focuses on reviewing and analyzing the current state and related problems of PTNMT,classifing PTNMT methods in terms of PTM's pre-trained methods,strategies,or specific tasks.We summarize the problems solved by PTNMT's methods,and conclude with a future outlook on PTNMT research.

外文关键词：

natural language processingpre-trained modelneural machine translation

作者：

曹智泉、穆永誉、肖桐、李北、张春良、朱靖波

展开 >

作者单位：

东北大学自然语言处理实验室,辽宁沈阳 110819

关键词：

自然语言处理预训练模型神经机器翻译

基金：

国家自然科学基金科技部科技创新2030—"新一代人工智能"重大项目云南省科技厅科技计划项目中央高校基本科研业务费项目中央高校基本科研业务费项目中央高校基本科研业务费项目111引智基地

项目编号：

622760562020AAA0107904202103AA080015N2216016N2216001N2216002B16009

出版年：

2024

中文信息学报

中国中文信息学会,中国科学院软件研究所

中文信息学报

CSTPCDCHSSCD北大核心

影响因子：0.8

ISSN：1003-0077

年,卷(期)：2024.38(6)