基于MFCC和HMM的语音识别优化方法研究

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：为探究基于梅尔频率倒谱系数(Mel-Frequency Cepstral Coefficients,MFCC)和隐马尔可夫模型(Hidden Markov Model,HMM)的语音识别优化方法,首先探讨语音识别系统的基本框架设计,其次分析MFCC特征提取方法,再次引入期望最大化(Expectation Maximization,EM)算法优化HMM参数,最后利用THCHS-30 数据集进行实验验证.结果表明,引入EM算法优化HMM,可有效克服传统HMM在复杂语音环境下的识别困难问题,显著提升系统的识别精度和健壮性.

外文标题：Research on Speech Recognition Optimization Method Based on MFCC and HMM

外文摘要：In order to explore the speech recognition optimization method based on Mel-Frequency Cepstral Coefficients(MFCC)and Hidden Markov Model(HMM),the basic framework design of the speech recognition system is first discussed.Secondly,the MFCC feature extraction method is analyzed,and the Expectation Maximization(EM)algorithm is introduced again to optimize HMM parameters.Finally,the THCHS-30 dataset is used for experimental verification.The results show that the introduction of EM algorithm to optimize HMM can effectively overcome the recognition difficulties of traditional HMM model in complex speech environment,and significantly improve the recognition accuracy and robustness of the system.

外文关键词：

speech recognitionMel-Frequency Cepstral Coefficients(MFCC)Hidden Markov Model(HMM)Expectation Maximization(EM)

作者：

郭佳淇、张继通

展开 >

作者单位：

郑州工业应用技术学院,河南郑州 451100

关键词：

语音识别梅尔频率倒谱系数(MFCC) 隐马尔可夫模型(HMM) 期望最大化(EM)

出版年：

2024

DOI：