现代计算机2024,Vol.30Issue(14) :26-30,58.DOI:10.3969/j.issn.1007-1423.2024.14.004

基于增强Swin Transformer的深度伪造人脸检测

Deepfake face detection based on enhanced Swin Transformer

李杏清 王志兵 杨恺
现代计算机2024,Vol.30Issue(14) :26-30,58.DOI:10.3969/j.issn.1007-1423.2024.14.004

基于增强Swin Transformer的深度伪造人脸检测

Deepfake face detection based on enhanced Swin Transformer

李杏清 1王志兵 2杨恺3
扫码查看

作者信息

  • 1. 广东创新科技职业学院信息工程学院,东莞 523960
  • 2. 东莞职业技术学院电子信息学院,东莞 523808
  • 3. 东莞职业技术学院建筑学院,东莞 523808
  • 折叠

摘要

针对传统卷积神经网络感受野的大小受限和特征交互学习能力弱,基于卷积神经网络的伪造人脸检测技术提取到的特征相对单一的问题,提出了基于增强Swin Transformer的深度伪造人脸检测方法,引入了局部多头自注意力和全局多头自注意力机制,结合了Swin Transformer的优势,能够有效地捕获图像上下文信息和视频时序关系,具有较强的全局感受野和长距离依赖建模能力.在DFDC数据集的实验结果表明,该方法优于基线方法,具有较好的深度伪造人脸检测能力.

Abstract

Addressing the issues of limited receptive field size and weak feature interaction learning capabilities in traditional convolutional neural networks,resulting in relatively singular feature extraction in conventional convolutional neural network-based deepfake face detection techniques,a deepfake face detection method based on enhanced Swin Transformer is proposed in this pa-per.This method introduces local multi-head self-attention and global multi-head self-attention mechanisms,leveraging the strengths of Swin Transformer to effectively capture image context information and video temporal relationships,with strong global receptive fields and long-distance dependency modeling capabilities.Experimental results on the DFDC dataset demonstrate that our approach outperforms baseline methods,exhibiting superior deepfake face detection capabilities.

关键词

增强Swin/Transformer/伪造人脸检测/音视频分解/一致性分析/特征融合

Key words

enhanced Swin Transformer/deepfake face detection/audiovisual decomposition/consistency analysis/feature fusion

引用本文复制引用

出版年

2024
现代计算机
中大控股

现代计算机

影响因子:0.292
ISSN:1007-1423
段落导航相关论文