首页|基于语义引导神经网络的人体动作识别算法

基于语义引导神经网络的人体动作识别算法

扫码查看
近年来,采用深度前馈神经网络对骨骼关节的三维坐标建模成为了一种趋势.但网络识别准确率低、巨大的参数量以及实时性差仍然是基于骨骼数据动作识别领域中急需解决的问题.为此,提出一种基于语义引导神经网络(SGN)改进的网络模型.首先,在原网络中引入了非局部特征提取模块用于增强其在高级语义指导模型训练和预测的表现,降低了其在自然语言处理任务中的计算复杂性和推理时间;其次,引入注意力机制学习每个图卷积网络层的通道权重并减少通道间的冗余信息,进一步提高模型的计算效率和识别准确率;此外,以可变形卷积模块动态学习不同图卷积网络(GCN)层通道的权重,并有效地聚合不同通道中的关节特征用于网络最后的分类识别,从而提高特征信息的利用率.最后,在NTU RGB+D和NTU RGB+D 120公开数据集上进行人体动作识别实验.实验结果表明,所提出的网络比大多数网络小一个数量级,并且在识别准确率上明显优于原网络和其他一些先进的算法.
Human action recognition algorithm based on semantics guided neural networks
In recent years,modeling the three-dimensional coordinates of skeletal joints using deep feedforward neural networks has become a trend.However,challenges such as low recognition accuracy,huge parametric volume,and poor real-time performance still persist in the field of skeletal data-based action recognition.In response,an improved network model built upon semantic-guided networks(SGN)was proposed.Firstly,a non-local feature extraction module was integrated into the original network to enhance its training and prediction performance in advanced semantic guidance models,thereby decreasing its computational complexity and inference time in natural language processing tasks.Secondly,an attention mechanism was implemented to learn the channel weights of each convolutional network layer and lessen the redundant information between channels,thus further enhancing the computational efficiency and recognition accuracy of the model.Additionally,a deformable convolution module was employed to dynamically learn the weights of different graph convolutional network(GCN)layer channels and effectively aggregate the joint features across different channels for the final classification of the network,thereby boosting the utilization of feature information.Finally,human action recognition experiments were conducted on the public datasets NTU RGB+D and NTU RGB+D 120.The numerical results demonstrated that the proposed network was an order of magnitude smaller than most networks,and it significantly outperformed the original network and several other state-of-the-art algorithms in terms of recognition accuracy.

human action recognitiongraph convolutional networksemantics guided neural networknon-local feature extractionattention mechanismdeformable convolution

郭宗洋、刘立东、蒋东华、刘子翔、朱熟康、陈京华

展开 >

长安大学信息工程学院,陕西 西安 710064

中山大学计算机学院,广东 广州 510006

人体动作识别 图卷积网络 语义引导神经网络 非局部特征提取 注意力机制 可变形卷积

国家自然科学基金项目

52172379

2024

图学学报
中国图学学会

图学学报

CSTPCD北大核心
影响因子:0.73
ISSN:2095-302X
年,卷(期):2024.45(1)
  • 32