基于深度学习的语音对话情感识别研究
Research on Emotion Recognition in Speech Dialogue Based on Deep Learning
白玉杰 1丁汨1
作者信息
- 1. 郑州澍青医学高等专科学校,河南郑州 450000
- 折叠
摘要
文章针对语音情感识别领域的复杂性,研究基于深度学习的情感识别框架.首先,用梅尔频谱系数进行特征提取,并引入音频数据增强方法.其次,采用长短时记忆网络(Long Short Term Memory,LSTM)方法进行情感识别.最后,利用瑞尔森情感语音和歌曲视听数据库(Ryerson Audio Visual Database of Emotional Speech and Song,RAVDESS)对该方法进行测试.实验结果表明,该方法能够准确地对语音样本进行分类.
Abstract
The article focuses on the complexity of speech emotion recognition and studies a deep learning based emotion recognition framework.Firstly,feature extraction is performed using Mel spectral coefficients,and audio data augmentation methods are introduced.Secondly,the Long Short Term Memory(LSTM)method is used for emotion recognition.Finally,the method was tested using the Ryerson Audio Visual Database of Emotional Speech and Song(RAVDESS).The experimental results show that this method can accurately classify speech samples.
关键词
深度学习/情感分析/语音对话Key words
deep learning/sentiment analysis/voice dialogue引用本文复制引用
出版年
2024