基于上下文通道注意力机制的人脸属性估计与表情识别

Facial attribute estimation and expression recognition based on contextual channel attention mechanism

徐杰 ¹钟勇 ²王阳 ³张昌福 ⁴杨观赐⁵

扫码查看

作者信息

1. 现代制造技术教育部重点实验室(贵州大学),贵阳 550025
2. 中国科学院成都计算机应用研究所,成都 610213
3. 省部共建公共大数据国家重点实验室(贵州大学),贵阳 550025
4. 贵州大学机械工程学院,贵阳 550025
5. 现代制造技术教育部重点实验室(贵州大学),贵阳 550025;省部共建公共大数据国家重点实验室(贵州大学),贵阳 550025
折叠

摘要

人脸特征蕴含诸多信息,在面部属性和情感分析任务中具有重要价值,而面部特征的多样性和复杂性使人脸分析任务变得困难.针对上述难题,从面部细粒度特征角度出发,提出基于上下文通道注意力机制的人脸属性估计和表情识别(FAER)模型.首先,构建基于ConvNext的局部特征编码骨干网络,并运用骨干网络编码局部特征的有效性来充分表征人脸局部特征之间的差异性;其次,提出上下文通道注意力(CC Attention)机制,通过动态自适应调整特征通道上的权重信息,表征深度特征的全局和局部特征,从而弥补骨干网络编码全局特征能力的不足;最后,设计不同分类策略,针对人脸属性估计(FAE)和面部表情识别(FER)任务,分别采用不同损失函数组合,以促使模型学习更多的面部细粒度特征.实验结果表明,所提FAER模型在人脸属性数据集CelebA(CelebFaces Attributes)上取得了91.87%的平均准确率,相较于次优模型SwinFace(Swin transformer for Face)高出0.55个百分点;在面部表情数据集RAF-DB和AffectNet上分别取得了91.75%和66.66%的准确率,相较于次优模型TransFER(Transformers for Facial Expression Recognition)分别高出0.84和0.43个百分点.

Abstract

Facial features contain a lot of information and hold significant value in facial attribute and expression analysis tasks,but the diversity and complexity of facial features make facial analysis tasks difficult.Aiming at the above issue,a model of Facial Attribute estimation and Expression Recognition based on contextual channel attention mechanism(FAER)was proposed from the perspective of fine-grained facial features.Firstly,a local feature encoding backbone network based on ConvNext was constructed,and by utilizing the effectiveness of the backbone network in encoding local features,the differences among facial local features were represented adequately.Secondly,a Contextual Channel Attention(CC Attention)mechanism was introduced.By adjusting the weight information on feature channels dynamically and adaptively,both global and local features of deep features were represented,so as to address the limitations of the backbone network ability in encoding global features.Finally,different classification strategies were designed.For Facial Attribute Estimation(FAE)and Facial Expression Recognition(FER)tasks,different combinations of loss functions were employed to encourage the model to learn more fine-grained facial features.Experimental results show that the proposed model achieves an average accuracy of 91.87%on facial attribute dataset CelebA(CelebFaces Attributes),surpassing the suboptimal model SwinFace(Swin transformer for Face)by 0.55 percentage points,and the proposed model achieves accuracies of 91.75%and 66.66%respectively on facial expression datasets RAF-DB and AffectNet,surpassing the suboptimal model TransFER(Transformers for Facial Expression Recognition)by 0.84 and 0.43 percentage points respectively.

关键词

人脸属性估计/面部表情识别/注意力机制/细粒度特征/特征差异

Key words

Facial Attribute Estimation(FAE)/Facial Expression Recognition(FER)/attention mechanism/fine-grained feature/feature difference

引用本文复制引用

出版年

2025

计算机应用

中国科学院成都计算机应用研究所

计算机应用

CSCD北大核心

影响因子：0.892

ISSN：1001-9081

段落导航