Robotics & Machine Learning Daily News2024,Issue(Feb.26) :60-61.DOI:10.1007/s11390-023-1302-6

Researchers from Southeast University Report Findings in Machine Translation (Visual Topic Semantic Enhanced Machine Translation for Multi-modal Data Efficiency)

Robotics & Machine Learning Daily News2024,Issue(Feb.26) :60-61.DOI:10.1007/s11390-023-1302-6

Researchers from Southeast University Report Findings in Machine Translation (Visual Topic Semantic Enhanced Machine Translation for Multi-modal Data Efficiency)

扫码查看

Abstract

Investigators publish new report on Machine Translation. According to news originating from Nanjing, People’s Republic of China, by NewsRx correspondents, research stated, “The scarcity of bilingual parallel corpus imposes limitations on exploiting the state-of-the-art supervised translation technology. One of the research directions is employing relations among multi-modal data to enhance performance.” Financial support for this research came from National Natural Science Foundation of China (NSFC). Our news journalists obtained a quote from the research from Southeast University, “However, the reliance on manually annotated multi-modal datasets results in a high cost of data labeling. In this paper, the topic semantics of images is proposed to alleviate the above problem. First, topic-related images can be automatically collected from the Internet by search engines. Second, topic semantics is sufficient to encode the relations between multi-modal data such as texts and images. Specifically, we propose a visual topic semantic enhanced translation (VTSE) model that utilizes topic-related images to construct a cross-lingual and cross-modal semantic space, allowing the VTSE model to simultaneously integrate the syntactic structure and semantic features. In the above process, topic similar texts and images are wrapped into groups so that the model can extract more robust topic semantics from a set of similar images and then further optimize the feature integration. The results show that our model outperforms competitive baselines by a large margin on the Multi30k and the Ambiguous COCO datasets.”

Key words

Nanjing/People’s Republic of China/Asia/Emerging Technologies/Machine Learning/Machine Translation/Southeast University

引用本文复制引用

出版年

2024
Robotics & Machine Learning Daily News

Robotics & Machine Learning Daily News

ISSN:
参考文献量50
段落导航相关论文