Detecting deepfake videos based on spatiotemporal attention and convolutional LSTM

扫码查看

原文链接

NSTL
Elsevier

外文摘要：Fake face detection is in dilemma with the rapid development of face manipulation technology. One way to improve the effectiveness of detector is to make full use of intra and inter frame information. In this paper, a novel Xception-LSTM algorithm is proposed by using our new spatiotemporal attention mechanism and convolutional long short-term memory (ConvLSTM). In the algorithm, the spatiotemporal attention mechanism, including spatial and temporal attention mechanism, is proposed to capture and enhance spatiotemporal correlations before dimension reduction of Xception. Thereafter, the ConvLSTM is introduced to consider frame structure information while modeling temporal information. The experimental results on three widely used datasets demonstrate that the proposed algorithms perform better than the state-of-the-art algorithms. In addition, the effectiveness of the spatiotemporal attention mechanism and ConvLSTM are illustrated in ablation experiments. (C) 2022 Elsevier Inc. All rights reserved.

外文关键词：

Face identificationDeepfake detectionAttention mechanismConvolutional LSTM

作者：

Chen, Beijing、Li, Tianmu、Ding, Weiping

展开 >

作者单位：

Nanjing Univ Informat Sci & Technol

Nantong Univ

出版年：

2022

DOI：

10.1016/j.ins.2022.04.014

Information Sciences

EISCI

ISSN：0020-0255

年,卷(期)：2022.601

被引量11
参考文献量39