首页|融合音素的缅甸语语音识别文本纠错

融合音素的缅甸语语音识别文本纠错

扫码查看
缅甸语语音识别文本中包含大量的同音和空格错误,使用通用的文本语义信息纠正错误字符,对缅甸语空格和同音错误定位和纠正不准确。考虑到缅甸语是一种声调语言,并且音素中包含了声调信息,因此提出融合音素的缅甸语语音识别文本纠错方法。通过参数共享策略对转录文本及其音素进行联合建模,利用音素信息辅助检测并纠正缅甸语同音和空格错误。实验结果表明,本文所提方法相比基线方法ConvSeq2Seq,在缅甸语语音识别纠错任务中的F1值提升了85。97%,达到了79。15%。
Text error correction of Burmese speech recognition based on phoneme fusion
The Burmese language speech recognition text contains a large number of homophones and space errors.General methods use text semantic information to correct erroneous characters,but they are not accurate in locating and correcting Burmese space and homophone errors.Considering that Bur-mese is a tonal language with tone information embedded within its phonemes,this paper proposes a method for correcting errors in Burmese language speech recognition text that incorporates phonemes.Parameter sharing strategy is used to jointly model the transcribed texts and theirs phonemes,phoneme information is used to assist in detecting and correcting Burmese homophones and space errors.Experi-mental results show that compared with ConvSeq2Seq method,the F1 value of the proposed method in the Burmese speech recognition correction task has increased by 85.97%,reaching 79.15%.

Burmese languagespeech recognition text correctionphonemeshared parameterbidi-rectional encoder representations from transformers(BERT)

陈璐、董凌、王文君、王剑、余正涛、高盛祥

展开 >

昆明理工大学信息工程与自动化学院,云南 昆明 650500

昆明理工大学云南省人工智能重点实验室,云南 昆明 650500

缅甸语 语音识别文本纠错 音素 共享参数 BERT

国家自然科学基金国家自然科学基金云南省高新技术产业发展项目云南省科技重大专项云南省科技重大专项云南省基础研究计划云南省学术和技术带头人后备人才项目

U21B202761972186201606202103AA080015202302AD080003202001AS070014202105AC160018

2024

计算机工程与科学
国防科学技术大学计算机学院

计算机工程与科学

CSTPCD北大核心
影响因子:0.787
ISSN:1007-130X
年,卷(期):2024.46(6)