首页|基于神经网络加速器的FPGA语音情感识别系统

基于神经网络加速器的FPGA语音情感识别系统

扫码查看
针对现有语音情感识别系统的部署功耗高、不具有便携性的缺点,提出一种基于神经网络加速器的FPGA语音情感识别系统设计.在FPGA上实现语音MFCC(Mel Frequency Cepstrum Coefficient)特征的提取,便于进行识别;为神经网络加速器设计指令生成算法,将网络模型部署在神经网络加速器实现语音情感识别.整个系统主要硬件资源消耗为37 078个LUT和153个DSP,支持在主流FPGA平台上的部署.经过检验,语音情感识别系统的指令运算误差可达0.06以下,输出误差为0.0004以下,满足语音情感识别的需求.
DESIGN OF FPGA SPEECH EMOTION RECOGNITION SYSTEM BASED ON NEURAL NETWORK ACCELERATOR
Aiming at the disadvantages of high-power consumption and no portability in the deployment of existing speech emotion recognition system,this paper proposes a design of FPGA speech emotion recognition system based on neural network accelerator.Mel frequency cepstrum coefficient(MFCC)feature extraction of speech was realized on FPGA,which was convenient for recognition.The instruction generation algorithm was designed for the neural network accelerator,and the network model was deployed in the neural network accelerator to realize speech emotion recognition.The main hardware resource consumption of the whole system is 37 078 LUTs and 153 DSPs,which supports the deployment on the mainstream FPGA platform.After testing,the instruction operation error of speech emotion recognition system is less than 0.06,and the output error is less than 0.000 4,which meets the needs of speech emotion recognition.

MFCCSpeech emotion recognitionNeural network acceleratorFPGA

乔栋、陈章进、邓良、张廓

展开 >

上海大学微电子研究与开发中心 上海 200444

上海大学计算中心 上海 200444

MFCC 语音情感识别 神经网络加速器 FPGA

国家自然科学基金项目

61674100

2024

计算机应用与软件
上海市计算技术研究所 上海计算机软件技术开发中心

计算机应用与软件

CSTPCD北大核心
影响因子:0.615
ISSN:1000-386X
年,卷(期):2024.41(10)