计算机工程与设计2024,Vol.45Issue(2) :587-593.DOI:10.16208/j.issn1000-7024.2024.02.034

基于幅度和相位混合特征交叉的语音增强方法

Speech enhancement method based on amplitude phase mixed feature intersection

卿朝进 付小伟 唐书海
计算机工程与设计2024,Vol.45Issue(2) :587-593.DOI:10.16208/j.issn1000-7024.2024.02.034

基于幅度和相位混合特征交叉的语音增强方法

Speech enhancement method based on amplitude phase mixed feature intersection

卿朝进 1付小伟 1唐书海1
扫码查看

作者信息

  • 1. 西华大学电气与电子信息学院,四川成都 610039
  • 折叠

摘要

为充分利用含噪语音信号的相位特征信息及其与幅度信息的相关性,提出一种幅度和相位混合特征交叉的单通道语音增强方法.提取含噪信号的对数功率谱和相位特征,依次交叉排列;计算复数掩模,将复数掩模的实虚部依次交叉以保持对称输入特征;在此基础上,构建深度编解码器网络(amplitude phase deep encoder decoder network,APDEDN)增强语音质量.实验结果表明,相较单一特征方法,提出方法获得了语音质量感知评估评分和短时目标可懂度上的改善.

Abstract

To make full use of the phase feature information from the noisy speech signal and its correlation with amplitude infor-mation,a mixed amplitude and phase features-based single channel speech enhancement method was presented.The logarithmic power spectrum and phase features of the noisy signals were extracted and arranged alternately.The complex mask was calculated,and the real and imaginary parts of the calculated complex mask were alternately arranged to maintain the features of input sym-metry.Based on these processing,an amplitude phase deep encoder decoder network(APDEDN)was constructed to enhance speech quality.Compared with the single feature method,experimental results show that the proposed method improves both the speech quality perception evaluation score and short time objective intelligibility.

关键词

语音增强/特征交叉/特征提取/混合特征/复数掩模/编解码器/深度学习

Key words

speech enhancement/feature intersection/feature extraction/mixed feature/complex ideal ratio mask/encoder-decoder/deep learning

引用本文复制引用

基金项目

四川省科技计划基金项目(2021JDRC0003)

四川省产业发展专项基金项目(ZYF-2018-056)

四川省科技计划项目重大科技专项基金项目(19ZDZX0016/2019YFG0395)

2020年成都市第二批重大科技应用示范基金项目(2020-YF09-00048-SN)

出版年

2024
计算机工程与设计
中国航天科工集团二院706所

计算机工程与设计

CSTPCD北大核心
影响因子:0.617
ISSN:1000-7024
参考文献量16
段落导航相关论文