基于金字塔分割注意力和联合损失的表情识别模型

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：如何提取多尺度特征和建模远程通道间的语义依赖仍是表情识别网络面临的挑战.本文提出一种基于金字塔分割注意力的残差网络(Residual network based on pyramid split attention,PSA-ResNet)模型,该模型将ResNet50残差模块中的3×3卷积替换成金字塔分割注意力,以有效提取多尺度特征,增强跨通道语义信息的相关性.同时,为缩小同类表情之间的差异,扩大不同类表情之间的距离,在训练过程中引入了Softmax loss和Center loss联合损失函数优化模型参数.本文所提出的方法在Fer2013和CK+两个公开的数据集上进行仿真实验,分别取得了74.26%和98.35%的准确率,进一步证实了该方法相比前沿算法具有更好的表情识别效果.

外文标题：An Expression Recognition Model Based on Pyramid Split Attention and Joint Loss

外文摘要：How to extract multi-scale features and model semantic dependencies between remote channels remains a challenge for expression recognition networks.This paper proposes a residual network based on pyramid split attention(PSA-ResNet),which replaces the 3×3 convolution in the ResNet50 residual module with PSA to effectively extract multi-scale features and enhance the correlation of cross channel information.In order to reduce the differences between similar expressions and expand the distance between different types of expressions,a joint loss function optimization parameter of Softmax loss and Center loss is introduced during the training process.The proposed model is simulated on two publicly available datasets,Fer2013 and CK+,and achieves accuracies of 74.26% and 98.35%,respectively,further confirming that this method has better recognition results compared to cutting-edge algorithms.

外文关键词：

expression recognitionpyramid split attention(PSA)multi-scale featureresidual network

作者：

谷瑞、顾家乐、宋翠玲

展开 >

作者单位：

南京大学数字经济与管理学院,南京 210003

苏州工业园区服务外包职业学院,苏州 215123

关键词：

表情识别金字塔分割注意力多尺度特征残差网络

出版年：

2024

DOI：

10.16337/j.1004-9037.2024.06.017

数据采集与处理

中国电子学会中国仪器仪表学会信号处理学会　中国仪器仪表学会中国物理学会微弱信号检测学会　南京航空航天大学

数据采集与处理

CSTPCD北大核心

影响因子：0.679

ISSN：1004-9037

年,卷(期)：2024.39(6)