首页|基于自适应门限融合策略的语音去噪算法

基于自适应门限融合策略的语音去噪算法

扫码查看
针对单个语音去噪算法在去噪过程中关注点较为单一,而多个语音去噪算法在融合时存在细节信息被削弱、融合效果不理想的问题,提出一种多个语音去噪算法下的自适应门限融合策略,将带噪信号分别经过3种不同的去噪算法得到3个去噪信号;根据自适应门限值以帧为单位进行帧筛选,得到自适应门限融合策略下的去噪信号;为提高识别效果,采用倒谱提升器对Gammatone频率倒谱系数(Gammatone Frequency Cepstrum Coefficient,GFCC)进行改进,并联合支持向量机进行噪声环境下的语音识别.实验结果表明,在5、10、15、20 dB四种信噪比下,通过该融合策略所得到的去噪信号与目前主流的顺序融合及多级融合方式相比,在语音识别率方面平均提高3.6%,融合倒谱提升器的GFCC特征相比于GFCC特征平均提高了 2.2%.
Speech Denoising Algorithm Based on Adaptive Threshold Fusion Strategy
To solve the problem that a single speech denoising algorithm has a single focus in the process of denoising,while the details of multiple speech denoising algorithms are weakened and the fusion effect is not ideal,an adaptive threshold fusion strategy under multiple speech denoising algorithms is proposed.Three denoising signals are obtained by using three different denoising algorithms.Frame filtering is carried out according to the adaptive threshold,and then the denoising signal under the adaptive threshold fusion strategy is obtained.On the other hand,in order to improve the recognition effect,the Gammatone Frequency Cepstrum Coefficient(GFCC)is improved by using a cepstrum lifter,and the speech recognition in noisy environment is carried out jointly with support vector machine.The experimental results show that under the four signal to noise ratios of 5,10,15 and 20 dB,the speech recognition rate of the denoised signals obtained by the fusion strategy is improved by 3.6%on average compared with the current mainstream sequential fusion and multistage fusion methods,and the GFCC feature using cepstrum lifter is improved by 2.2%on average compared with the GFCC feature.

speech denoisingadaptive threshold fusionnoisy speechframe filteringGammatone filter

薛珮芸、师晨康、白静、赵建星、汪思斌

展开 >

太原理工大学电子信息与光学工程学院,山西晋中 030600

山西高等创新研究院博士后科研工作站,山西太原 030002

语音去噪 自适应门限融合 带噪语音 帧筛选 Gammatone滤波器

山西省应用基础研究计划山西省基础研究计划

201901D11109420210302124544

2024

无线电工程
中国电子科技集团公司第五十四研究所

无线电工程

影响因子:0.667
ISSN:1003-3106
年,卷(期):2024.54(4)
  • 18