首页|基于深层动态特征双流网络的高效行为识别算法

基于深层动态特征双流网络的高效行为识别算法

扫码查看
为了更高效地获得视频中的行为信息,提出一种结合时域卷积与双流卷积网络的人体行为识别算法。利用多层时域卷积从视频获取动态信息,得到二维的深层动态特征;构建双流卷积网络并采用深层动态特征代替光流特征作为运动信息流的输入;加权融合双流结果,获得对行为的判定。在公开数据集UCF101、HMDB51与NTU-RGBD-6 0测试,最高准确率为94。2%、70。9%与89。1%(跨对象实验)。当与经典算法ECO(E fficient Convo-lutional Network)和TSM(Temporal Shift Module)准确率相近时,平均并行速度分别提高2。1倍和3。6倍。所研究算法提高了计算效率,更具有实用性。
AN EFFICIENT ACTION RECOGNITION ALGORITHM BASED ON DEEP DYNAMIC FEATURE DUAL-STREAM CNN
In order to obtain the behavior information in video more efficiently,we propose a human action recognition method based on temporal convolutional neural network and dual-stream convolutional neural network.Multi-layer temporal convolution was used to obtain dynamic information from the video and obtain two-dimensional depth dynamic features.A dual-stream CNN was constructed,and depth dynamic features were used as input to the motion information stream instead of optical flow features.The dual-stream classification scores were fused in a weighted average to obtain a determination of the video action category.The algorithm was tested on public data set UCF101,HMDB51 and NTU-RGBD-60,with the highest accuracy of 94.2%,70.9%and 89.1%(cross-object experiments).When the accuracy is similar to the classical algorithms,such as ECO and TSM,the average parallel speed is increased by a factor of 2.1 and 3.6 respectively.The proposed algorithm improves the computational efficiency and is more practical.

Computer visionAction recognitionDual-stream CNN3D convolution

高庆吉、徐达、罗其俊、邢志伟

展开 >

中国民航大学机器人研究所 天津 300300

计算机视觉 行为识别 双流卷积网络 三维卷积

国家自然科学基金项目天津市教委科研计划项目

U15332032019KJ117

2024

计算机应用与软件
上海市计算技术研究所 上海计算机软件技术开发中心

计算机应用与软件

CSTPCD北大核心
影响因子:0.615
ISSN:1000-386X
年,卷(期):2024.41(9)