基于神经网络加速器的FPGA语音情感识别系统

扫码查看

原文链接

万方数据
维普

中文摘要：针对现有语音情感识别系统的部署功耗高、不具有便携性的缺点,提出一种基于神经网络加速器的FPGA语音情感识别系统设计.在FPGA上实现语音MFCC(Mel Frequency Cepstrum Coefficient)特征的提取,便于进行识别;为神经网络加速器设计指令生成算法,将网络模型部署在神经网络加速器实现语音情感识别.整个系统主要硬件资源消耗为37 078个LUT和153个DSP,支持在主流FPGA平台上的部署.经过检验,语音情感识别系统的指令运算误差可达0.06以下,输出误差为0.0004以下,满足语音情感识别的需求.

外文标题：DESIGN OF FPGA SPEECH EMOTION RECOGNITION SYSTEM BASED ON NEURAL NETWORK ACCELERATOR

外文摘要：Aiming at the disadvantages of high-power consumption and no portability in the deployment of existing speech emotion recognition system,this paper proposes a design of FPGA speech emotion recognition system based on neural network accelerator.Mel frequency cepstrum coefficient(MFCC)feature extraction of speech was realized on FPGA,which was convenient for recognition.The instruction generation algorithm was designed for the neural network accelerator,and the network model was deployed in the neural network accelerator to realize speech emotion recognition.The main hardware resource consumption of the whole system is 37 078 LUTs and 153 DSPs,which supports the deployment on the mainstream FPGA platform.After testing,the instruction operation error of speech emotion recognition system is less than 0.06,and the output error is less than 0.000 4,which meets the needs of speech emotion recognition.

外文关键词：

MFCCSpeech emotion recognitionNeural network acceleratorFPGA

作者：

乔栋、陈章进、邓良、张廓

展开 >

作者单位：

上海大学微电子研究与开发中心上海 200444

上海大学计算中心上海 200444

关键词：

MFCC 语音情感识别神经网络加速器 FPGA

基金：

国家自然科学基金项目

项目编号：

61674100

出版年：

2024

DOI：

10.3969/j.issn.1000-386x.2024.10.025

计算机应用与软件

上海市计算技术研究所上海计算机软件技术开发中心

计算机应用与软件

CSTPCD北大核心

影响因子：0.615

ISSN：1000-386X

年,卷(期)：2024.41(10)