自动化与仪器仪表2024,Issue(6) :179-183.DOI:10.14016/j.cnki.1001-9227.2024.06.179

融合聚类算法与改进粒子群算法的机器翻译句式一致性研究

The consistency study of fusion clustering algorithm and improved particle swarm algorithm

雷宏友
自动化与仪器仪表2024,Issue(6) :179-183.DOI:10.14016/j.cnki.1001-9227.2024.06.179

融合聚类算法与改进粒子群算法的机器翻译句式一致性研究

The consistency study of fusion clustering algorithm and improved particle swarm algorithm

雷宏友1
扫码查看

作者信息

  • 1. 咸阳师范学院,陕西咸阳 712000
  • 折叠

摘要

针对机器翻译过程中存在的汉语和英语之间的句式差异问题,研究通过构建适应度函数来衡量语句一致性.同时引入了改进粒子群算法(Improved Particle Swarm Optimization,IPSO)以及改进的聚类算法(Improved K-means algorithm,IK-Means)求解适应度函数,以提升汉英语句翻译的句式一致性.结果表明,IPSO算法在不同数据集上取得了明显优于其他算法的BLEU分数.在汉英可比语料库上,IPSO算法的双语评估研究(Bilingual Evaluation Understudy,BLEU)分数最高达到了23.11,平均值为21.28,较基本的PSO算法分别提高了 8.97和1 1.28.同时,在中国英汉平行语料库上,IPSO算法的BLEU分数最高达到了 20.81、平均值为18.79.说明融合IK-Means算法与IPSO算法的机器翻译方法能够具有显著的翻译性能,能够提升汉英翻译的句式一致性,为机器翻译质量的提升提供了可靠的方法参考.

Abstract

In view of the sentence pattern difference between Chinese and English in the process of machine translation,the study measures the sentence consistency by constructing the fitness function.At the same time,improved particle swarm algorithm(Im-proved Particle Swarm Optimization,IPSO)and improved clustering algorithm(Improved K-means algorithm,IK-Means)are intro-duced to solve the fitness function to improve the consistency of Chinese-English sentence translation.The results show that the IPSO algorithm achieves significantly better BLEU scores than other algorithms on different datasets.In the Chinese-English comparable corpus,the bilingual evaluation study(Bilingual Evaluation Understudy,BLEU)reached the highest score of 23.11 and the average value of 21.28,which improved by 8.97 and 11.28 over the basic PSO algorithm.At the same time,on the parallel corpora of Eng-lish and Chinese in China,the BLEU score of IPSO algorithm reached 20.81 and the average value was 18.79.It shows that the ma-chine translation method integrating IK-Means algorithm and IPSO algorithm can have significant translation performance,improve the sentence consistency of Chinese-English translation,and provide a reliable method reference for the improvement of machine translation quality.

关键词

聚类算法/改进粒子群算法/机器翻译/句式一致性

Key words

clustering algorithm/improved particle swarm algorithm/machine translation/sentence pattern consistency

引用本文复制引用

基金项目

陕西省教育科学"十四五"规划课题(SGH21Y0204)

出版年

2024
自动化与仪器仪表
重庆工业自动化仪表研究所,重庆市自动化与仪器仪表学会

自动化与仪器仪表

CSTPCD
影响因子:0.327
ISSN:1001-9227
参考文献量12
段落导航相关论文