具有互补特征学习框架和注意力特征融合模块的语音情感识别模型

扫码查看

原文链接

NETL
NSTL
万方数据
维普

中文摘要：针对深度学习的特征提取方法无法全面提取语音中的情感特征,也无法有效地融合这些特征的问题,提出了一种集成互补特征学习框架和注意力特征融合模块的语音情感识别模型.该互补特征学习框架包含两条独立的表征提取分支和一条交互互补表征提取分支,能够全面覆盖情感特征的独立性表征和互补性表征.为了进一步优化模型性能,引入注意力特征融合模块,该模块能够根据不同表征对情感分类的贡献程度分配合适的权重,使模型能最大程度地关注对情感识别最有助的特征.基于两个公开情感数据库(Emo-DB和IEMOCAP)的仿真实验结果,验证了所提模型的鲁棒性和有效性.

外文标题：Speech Emotion Recognition with Complementary Feature Learning Framework and Attentional Feature Fusion Module

外文摘要：Addressing the limitations of deep learning feature extraction methods,which fail to comprehensively extract and effectively integrate emotional features from speech,this paper proposes a novel speech emotion recog-nition model.It integrates a complementary feature learning framework and an attention feature fusion module.The complementary feature learning framework consists of two independent representational extraction branches and an interactive complementary representational extraction branch,thoroughly covering both independent and complementary representations of emotional features.To further optimize model performance,an attention fea-ture fusion module is introduced.This module allocates appropriate weights based on the contribution level of different representations to emotion classification,enabling the model to focus maximally on features most bene-ficial for emotion recognition.Simulation experiments conducted on two public emotion databases(Emo-DB and IEMOCAP)validate the robustness and effectiveness of the proposed model.

外文关键词：

speech emotion recognitiondeep neural networksemotional feature representationfeature extrac-torfeature fusionattention mechanismartificial intelligence

作者：

黄佩瑶、程慧慧、唐小煜

展开 >

作者单位：

华南师范大学工学部电子与信息工程学院,广东佛山 528225

华南师范大学物理学院,广东广州 510006

关键词：

语音情感识别深度神经网络情感特征表征特征提取器特征融合注意力机制人工智能

基金：

国家自然科学基金广东省大学生科技创新人才培养专项

项目编号：

62001173pdjh2022a0131

出版年：

2024

DOI：

10.13568/j.cnki.651094.651316.2023.07.05.0002

新疆大学学报(自然科学版)(中英文)

新疆大学

新疆大学学报(自然科学版)(中英文)

CSTPCD

影响因子：0.13

ISSN：2096-7675

年,卷(期)：2024.41(1)

参考文献量25