基于循环神经网络的双麦克风语音增强算法

扫码查看

原文链接

万方数据
维普

中文摘要：针对基于神经网络的语音增强算法难以部署在助听器中的问题,基于循环神经网络,提出了一种低延迟、低复杂度的双麦克风语音增强算法.该算法利用两个麦克风做空域滤波初步去除非期望方向噪声,并进一步通过循环神经网络得到纯净语音信号.为了解决助听器中全相位滤波器组阶数较多而引起群延迟较大的问题,创新性地提出一种分段式滤波器组,在保证性能的同时有效减少了阶数.仿真结果显示,该滤波器组在16 k采样率下的群延迟为3.125 ms,在0 dB的babble、volvo、factory1环境下,该语音增强算法的SNR平均提升了10.5565 dB,PESQ平均提升了0.6787.实际测试结果中,SNR平均提升了9.4394 dB,PESQ平均提升了0.7350.当DSP系统时钟频率为104 MHz时,助听器经过的系统延迟为8.4 ms,功耗为6.2 mA,可以很好满足助听器高续航的需求.

外文标题：Speech Enhancement with Two Microphones Based on Recurrent Neural Network

外文摘要：Large-scale neural networks are difficult to deploy in hearing aids. A dual microphone speech enhancement algorithm based on recurrent neural networks( RNN) is proposed,which has the advantages of low latency and low complexity. The algorithm uses two micro-phones for beamforming to preliminarily filter out the unexpected directional noise,and further obtains the pure voice signal through RNN. In order to solve the problem of large group delay caused by excessive order of filter banks,a piecewise all phase filter bank is proposed, which can reduce the group delay and computation. The simulation results show that the group delay is 3. 125 ms under 16 k sampling rate,and under the 0 dB babble,volvo,and factory1 environment,the SNR has increased by 10.5565 dB,and the PESQ has increased by 0.6787. In the actual test results,the SNR has increased by 9.4394 dB,and the PESQ has increased by 0.7350. When the clock frequen-cy is 104 MHz,the DSP system delay of the hearing aid is 8.4 ms and the power consumption is 6.2 mA.

外文关键词：

speech enhancementfilterbankrecurrent neural networkhearing aidsDSP implementation

作者：

邱智乾、陈霏、郎标

展开 >

作者单位：

天津大学微电子学院,天津市成像与感知微电子技术重点实验室,天津300072

深圳清华大学研究院,广东深圳518057

关键词：

语音增强滤波器组循环神经网络助听器,DSP实现

基金：

国家重点研发计划深圳市科技计划深圳市科技计划深圳市科技计划

项目编号：

2016YFA0202201JSGG20191129141019090JCYJ20210324115610028JSGG20210713091808027

出版年：

2024

DOI：

10.3969/j.issn.1004-1699.2024.03.009

传感技术学报

东南大学中国微米纳米技术学会

传感技术学报

CSTPCD北大核心

影响因子：1.276

ISSN：1004-1699

年,卷(期)：2024.37(3)

参考文献量5