基于增强Swin Transformer的深度伪造人脸检测

Deepfake face detection based on enhanced Swin Transformer

扫码查看

原文链接

维普
万方数据

中文摘要：针对传统卷积神经网络感受野的大小受限和特征交互学习能力弱,基于卷积神经网络的伪造人脸检测技术提取到的特征相对单一的问题,提出了基于增强Swin Transformer的深度伪造人脸检测方法,引入了局部多头自注意力和全局多头自注意力机制,结合了Swin Transformer的优势,能够有效地捕获图像上下文信息和视频时序关系,具有较强的全局感受野和长距离依赖建模能力.在DFDC数据集的实验结果表明,该方法优于基线方法,具有较好的深度伪造人脸检测能力.

外文摘要：Addressing the issues of limited receptive field size and weak feature interaction learning capabilities in traditional convolutional neural networks,resulting in relatively singular feature extraction in conventional convolutional neural network-based deepfake face detection techniques,a deepfake face detection method based on enhanced Swin Transformer is proposed in this pa-per.This method introduces local multi-head self-attention and global multi-head self-attention mechanisms,leveraging the strengths of Swin Transformer to effectively capture image context information and video temporal relationships,with strong global receptive fields and long-distance dependency modeling capabilities.Experimental results on the DFDC dataset demonstrate that our approach outperforms baseline methods,exhibiting superior deepfake face detection capabilities.

外文关键词：

enhanced Swin Transformerdeepfake face detectionaudiovisual decompositionconsistency analysisfeature fusion

作者：

李杏清、王志兵、杨恺

展开 >

作者单位：

广东创新科技职业学院信息工程学院,东莞 523960

东莞职业技术学院电子信息学院,东莞 523808

东莞职业技术学院建筑学院,东莞 523808

关键词：

增强Swin Transformer 伪造人脸检测音视频分解一致性分析特征融合

出版年：

2024

DOI：

10.3969/j.issn.1007-1423.2024.14.004

现代计算机

中大控股

现代计算机

影响因子：0.292

ISSN：1007-1423

年,卷(期)：2024.30(14)