电声技术2024,Vol.48Issue(9) :73-75.DOI:10.16311/j.audioe.2024.09.023

基于统计模型的人声识别优化研究

Research on Optimization of Voice Recognition Based on Statistical Models

晁松杰 娄艺
电声技术2024,Vol.48Issue(9) :73-75.DOI:10.16311/j.audioe.2024.09.023

基于统计模型的人声识别优化研究

Research on Optimization of Voice Recognition Based on Statistical Models

晁松杰 1娄艺1
扫码查看

作者信息

  • 1. 漯河职业技术学院,河南 漯河 462600
  • 折叠

摘要

为研究基于变分推断的高斯混合模型(Gaussian Mixture Model,GMM)在人声识别中的优化方法,首先设计人声识别系统框架,其次阐述传统GMM在人声识别系统中的基本原理和特点,再次详细介绍变分推断的基本原理及其在GMM优化中的应用,最后采用公开数据集进行实验评估.仿真结果表明,优化后的GMM在识别准确率、精确率、召回率以及F1分数等指标上均显著优于传统GMM.

Abstract

In order to study the optimization method of Gaussian Mixture Model (GMM) based on variational inference in human voice recognition,the framework of human voice recognition system is first designed,and the basic principle and characteristics of traditional GMM in human voice recognition systems are described. Then,the basic principle of variational inference and its application in GMM optimization are introduced in detail. Finally,experimental evaluation is carried out with open data set.The simulation results show that the optimized GMM is significantly better than the traditional GMM in recognition accuracy,precision,recall,and F1 score.

关键词

高斯混合模型(GMM)/人声识别/变分推断/统计模型

Key words

Gaussian Mixture Model (GMM)/voice recognition/variational inference/statistical model

引用本文复制引用

出版年

2024
电声技术
电视电声研究所(中国电子科技集团公司第三研究所)

电声技术

影响因子:0.259
ISSN:1002-8684
参考文献量7
段落导航相关论文