本文提出一个基于扩张卷积的多尺度融合行人原型和时空特征的深度自编码器网络.为了更好地利用视频中行人的时序特征,在编码器和解码器的潜在空间处添加一个双分支结构,分别是预测时空特征的递归神经网络分支和保存行人正常模式的记忆存储模块.为了增强行人特征提取,忽略背景信息影响,增加模型的泛化能力,在编码器中加入改进的空洞空间金字塔池化(Atrous Spatial Pyramid Pooling,ASPP)模块,并在卷积块中使用混合扩张卷积(Hybrid Di-lated Convolution,HDC)原则,解决行人大小变化的问题,同时在解码器中引入多级残差信道注意力机制,获取更多的上下文信息.模型在数据集USCD Ped2,CUHK Avenue的曲线下面积(Area Under the Curve,AUC)分别达到了0.982,0.928.
Anomalous Behavior Detection Network Based on Dilated Convolution and Fused Temporal Features
In this paper,we propose a multi-scale deep autoencoder network based on dilated convolution,incorporating pedes-trian prototypes and spatio-temporal features.To better exploit the temporal features of pedestrians in videos,a dual-branch structure is added to the potential space of the encoder and decoder,the ST-RNN branch of the recurrent neural network for pre-dicting spatio-temporal features and the memory storage module for preserving the normal patterns of pedestrians.To enhance pe-destrian feature extraction,ignore the effect of background information,and improve the generalization ability of the model,an improved atrous spatial pyramid pooling(ASPP)module is added in the encoder,the hybrid dilated convolution(HDC)prin-ciple is used in the convolution block to solve the pedestrian size variation problem,while a multi-level residual channel atten-tion mechanism is introduced in the decoder to obtain more contextual information.The corresponding area under the ROC curve(AUC)of this model reaches 0.982,0.928 for USCD ped2,CUHK Avenue datasets,respectively.