首页|基于增强Swin Transformer的深度伪造人脸检测

基于增强Swin Transformer的深度伪造人脸检测

扫码查看
针对传统卷积神经网络感受野的大小受限和特征交互学习能力弱,基于卷积神经网络的伪造人脸检测技术提取到的特征相对单一的问题,提出了基于增强Swin Transformer的深度伪造人脸检测方法,引入了局部多头自注意力和全局多头自注意力机制,结合了Swin Transformer的优势,能够有效地捕获图像上下文信息和视频时序关系,具有较强的全局感受野和长距离依赖建模能力.在DFDC数据集的实验结果表明,该方法优于基线方法,具有较好的深度伪造人脸检测能力.
Deepfake face detection based on enhanced Swin Transformer
Addressing the issues of limited receptive field size and weak feature interaction learning capabilities in traditional convolutional neural networks,resulting in relatively singular feature extraction in conventional convolutional neural network-based deepfake face detection techniques,a deepfake face detection method based on enhanced Swin Transformer is proposed in this pa-per.This method introduces local multi-head self-attention and global multi-head self-attention mechanisms,leveraging the strengths of Swin Transformer to effectively capture image context information and video temporal relationships,with strong global receptive fields and long-distance dependency modeling capabilities.Experimental results on the DFDC dataset demonstrate that our approach outperforms baseline methods,exhibiting superior deepfake face detection capabilities.

enhanced Swin Transformerdeepfake face detectionaudiovisual decompositionconsistency analysisfeature fusion

李杏清、王志兵、杨恺

展开 >

广东创新科技职业学院信息工程学院,东莞 523960

东莞职业技术学院电子信息学院,东莞 523808

东莞职业技术学院建筑学院,东莞 523808

增强Swin Transformer 伪造人脸检测 音视频分解 一致性分析 特征融合

2024

现代计算机
中大控股

现代计算机

影响因子:0.292
ISSN:1007-1423
年,卷(期):2024.30(14)