Video anomaly detection based on random masked Transformer
With the increasing popularity of various types of surveillance cameras,video anomaly event detection has become increasingly important.In this paper,we introduce a novel method for anomaly detection that utilizes a random masked Transformer.This approach involves masking a portion of the video patches in the video sequence to be able to extract features using a temporal Transformer.Besides,we design a temporal Transformer block and a spatial Transformer block to extract spatial-temporal features.Based on the spatial and temporal Transformers,we define anomalies as those with large differences between the predicted frame and true frame.Furthermore,we propose to use the RGB difference to represent the motion,which is more efficient than the optical flow-based methods.Our experiments on public datasets demonstrate that the proposed method using a random masked Transformer approach can detect anomalies from videos effectively.