计算机应用与软件2024,Vol.41Issue(10) :163-169,246.DOI:10.3969/j.issn.1000-386x.2024.10.025

基于神经网络加速器的FPGA语音情感识别系统

DESIGN OF FPGA SPEECH EMOTION RECOGNITION SYSTEM BASED ON NEURAL NETWORK ACCELERATOR

乔栋 陈章进 邓良 张廓
计算机应用与软件2024,Vol.41Issue(10) :163-169,246.DOI:10.3969/j.issn.1000-386x.2024.10.025

基于神经网络加速器的FPGA语音情感识别系统

DESIGN OF FPGA SPEECH EMOTION RECOGNITION SYSTEM BASED ON NEURAL NETWORK ACCELERATOR

乔栋 1陈章进 2邓良 1张廓1
扫码查看

作者信息

  • 1. 上海大学微电子研究与开发中心 上海 200444
  • 2. 上海大学微电子研究与开发中心 上海 200444;上海大学计算中心 上海 200444
  • 折叠

摘要

针对现有语音情感识别系统的部署功耗高、不具有便携性的缺点,提出一种基于神经网络加速器的FPGA语音情感识别系统设计.在FPGA上实现语音MFCC(Mel Frequency Cepstrum Coefficient)特征的提取,便于进行识别;为神经网络加速器设计指令生成算法,将网络模型部署在神经网络加速器实现语音情感识别.整个系统主要硬件资源消耗为37 078个LUT和153个DSP,支持在主流FPGA平台上的部署.经过检验,语音情感识别系统的指令运算误差可达0.06以下,输出误差为0.0004以下,满足语音情感识别的需求.

Abstract

Aiming at the disadvantages of high-power consumption and no portability in the deployment of existing speech emotion recognition system,this paper proposes a design of FPGA speech emotion recognition system based on neural network accelerator.Mel frequency cepstrum coefficient(MFCC)feature extraction of speech was realized on FPGA,which was convenient for recognition.The instruction generation algorithm was designed for the neural network accelerator,and the network model was deployed in the neural network accelerator to realize speech emotion recognition.The main hardware resource consumption of the whole system is 37 078 LUTs and 153 DSPs,which supports the deployment on the mainstream FPGA platform.After testing,the instruction operation error of speech emotion recognition system is less than 0.06,and the output error is less than 0.000 4,which meets the needs of speech emotion recognition.

关键词

MFCC/语音情感识别/神经网络加速器/FPGA

Key words

MFCC/Speech emotion recognition/Neural network accelerator/FPGA

引用本文复制引用

基金项目

国家自然科学基金项目(61674100)

出版年

2024
计算机应用与软件
上海市计算技术研究所 上海计算机软件技术开发中心

计算机应用与软件

CSTPCD北大核心
影响因子:0.615
ISSN:1000-386X
段落导航相关论文