计算机应用与软件2024,Vol.41Issue(2) :130-137.DOI:10.3969/j.issn.1000-386x.2024.02.019

基于深度全卷积神经弹性网络WCGAN-GP模型的语音增强研究

SPEECH ENHANCEMENT BASED ON DEEP FULLY CONVOLUTIONAL NEURAL ELASTIC NETWORK WCGAN-GP MODEL

许雯婷 龚晓峰
计算机应用与软件2024,Vol.41Issue(2) :130-137.DOI:10.3969/j.issn.1000-386x.2024.02.019

基于深度全卷积神经弹性网络WCGAN-GP模型的语音增强研究

SPEECH ENHANCEMENT BASED ON DEEP FULLY CONVOLUTIONAL NEURAL ELASTIC NETWORK WCGAN-GP MODEL

许雯婷 1龚晓峰1
扫码查看

作者信息

  • 1. 四川大学电气工程学院 四川 成都 610065
  • 折叠

摘要

Wasserstein距离生成对抗网络(Wasserstein Generative Adversal Network,WGAN)模型[1]在语音增强中运用广泛,但存在梯度易爆炸、性能不稳定等问题.引入梯度惩罚(Gradient Penalty,GP)和弹性网络条件约束,并将生成器和判别器优化成深度全卷积神经网络(Deep Fully Convolutional Neural Networks,DFCNN)结构,提出一种基于DFCNN的弹性网络条件梯度惩罚(Wasserstein Conditional Generative Adversal Network Gradient Penalty,WCGAN-GP)模型.改进后的模型可以达到真实Lipschitz限制条件,提高了可控性、稳定性和特征提取能力,能更快优化训练.实验将改进后的模型与WGAN对不同噪声条件下的语音进行增强,结果证实了改进后的模型在语音增强方面的优越性.

Abstract

Wasserstein generative adversal network(WGAN)model has been widely used in speech enhancement,but WGAN has problems such as gradient explosion and unstable performance.This paper introduced gradient penalty(GP)and elastic network condition constraints,and optimized the generator and discriminator into deep fully convolutional neural networks(DFCNN)structure,and proposed a kind of Wasserstein conditional gradient penalty generative adversal Elastic network(WCGAN-GP)model based on DFCNN.The improved model could reach the real Lipschitz constraints,improve the controllability,stability and feature extraction capabilities,and optimize training faster.The experiment enhanced the speech under different noise conditions with the improved model and WGAN.The results verify the superiority of the improved model in speech enhancement.

关键词

Wasserstein距离/深度全卷积神经网络/梯度惩罚/弹性网络/条件约束

Key words

Wasserstein distance/Deep fully convolutional neural networks/Gradient penalty/Elastic networks/Conditional constraints

引用本文复制引用

基金项目

四川省重点研发计划项目(2020YFG0051)

国家自然科学基金项目(61876114)

校企合作项目(19H1121)

校企合作项目(19H0355)

出版年

2024
计算机应用与软件
上海市计算技术研究所 上海计算机软件技术开发中心

计算机应用与软件

CSTPCD北大核心
影响因子:0.615
ISSN:1000-386X
参考文献量2
段落导航相关论文