融合目标端上下文的篇章神经机器翻译

Modeling Target-side Context for Document-level Neural Machine Translation

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：神经机器翻译在句子级翻译任务上取得了令人瞩目的效果,但是句子级翻译的译文会存在一致性、指代等篇章问题,篇章翻译通过利用上下文信息来解决上述问题.不同于以往使用源端上下文建模的方法,该文提出了融合目标端上下文信息的篇章神经机器翻译.具体地,该文借助推敲网络的思想,对篇章源端进行二次翻译,第一次基于句子级翻译,第二次翻译参考了全篇的第一次翻译结果.基于 LDC 中英篇章数据集和 WMT 英德篇章数据集的实验结果表明,在引入较少的参数的条件下,该文方法能显著提高翻译性能.同时,随着第一次翻译(即句子级译文)质量的提升,所提方法也更有效.

外文摘要：Recently neural machine translation(NMT)has achieved remarkable success in sentence-level translation.However,it still cannot resolve a wide variety of discourse phenomena,such as lexical cohesion and coreference,which can be alleviated by using context information in document-level translation.In contrast to existing studies of modeling source-side context,this paper proposes to model target-side context in document-level NMT.Specifically,motivated by deliberation networks,our approach translates source-side document twice.In the first-pass transla-tion,it performs sentence-level translation.In the second-pass,it properly translates each sentence by modeling the target-side context which has just be generated from the first-pass translation.Experimental results on LDC Chinese-to-English and WMT English-to-German document-level translation tasks show that our approach significantly im-proves translation performance by introducing few parameters.Meanwhile,it is observed that the proposed approach benefits more if the performance of the first-pass translation(i.e.,sentence-level NMT)is improved.

外文关键词：

neural machine translationdeliberation networksdocument-level translation

作者：

贾爱鑫、李军辉、贡正仙、张民

展开 >

作者单位：

苏州大学计算机科学与技术学院,江苏苏州 215006

关键词：

神经机器翻译推敲网络篇章翻译

基金：

国家自然科学基金国家自然科学基金

项目编号：

6187612061976148

出版年：

2024

中文信息学报

中国中文信息学会,中国科学院软件研究所

中文信息学报

CSTPCDCHSSCD北大核心

影响因子：0.8

ISSN：1003-0077

年,卷(期)：2024.38(4)

参考文献量27