特征增强与残差重塑的多重一致性约束半监督视频动作检测

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：一致性正则化半监督视频动作检测方法对原始数据和增广数据进行特征表示时容易引起两类数据间判别域偏差,导致判别结果无法拟合.针对该问题,文中提出特征增强与残差重塑的多重一致性约束半监督视频动作检测方法.首先,将基础动作特征描述子在时空维进行连续性增强编码,获取视频动作理解中至关重要的上下文信息.然后,在通过残差特征重塑模块获得多尺度残差信息的同时进行特征重塑.为了降低不同数据间的判别偏差,分别从分类特征与动作定位特征角度对原始数据和增广数据施加多重一致性约束,实现模型对增广数据和原始数据判别结果和特征表示的匹配.最后,在JHMDB-21、UCF101-24 数据集上的实验表明,文中方法能有效提高少样本标记条件下视频动作检测准确度,具有较强的竞争力.

外文标题：Multi-consistency Constrained Semi-supervised Video Action Detection Based on Feature Enhancement and Residual Reshaping

外文摘要：The feature representations of both original data and augmented data in the consistency regularized semi-supervised video action detection method tend to induce discriminative domain bias between two types of data,thereby resulting in inadequate fitting of the discriminative results.To address this issue,a multi-consistency constrained semi-supervised video action detection method based on feature enhancement and residual reshaping is proposed in this paper.Firstly,the basic action feature descriptors are continuously enhanced and encoded in the spatiotemporal dimension to obtain crucial contextual information for video action understanding.Subsequently,a residual feature reshaping module is employed to obtain multi-scale residual information while reshaping the features.To reduce the discriminative bias between different types of data,multiple consistency constraints are applied to the original data and the augmented data from the perspectives of classification features and action localization features,achieving a match between discriminative results and feature representation of the augmented data and the original data.Experimental results on JHMDB-21 and UCF101-24 datasets demonstrate the effectiveness of the proposed method in improving video action detection accuracy under the condition of limited labeled samples and strong competitiveness.

外文关键词：

Semi-supervised LearningVideo Action DetectionFeature EnhancementMultiple Con-sistency Constraints

作者：

胡正平、张琦明、王雨露、张和浩、邸继锐

展开 >

作者单位：

燕山大学信息科学与工程学院秦皇岛 066004

燕山大学河北省信息传输与信号处理重点实验室秦皇岛 066004

关键词：

半监督学习视频动作检测特征增强多重一致性约束

基金：

国家自然科学基金项目国家自然科学基金青年科学基金项目

项目编号：

6177142062001413

出版年：

2024

DOI：

10.16451/j.cnki.issn1003-6059.202405002

模式识别与人工智能

中国自动化学会,国家智能计算机研究开发中心,中国科学院合肥智能机械研究所

模式识别与人工智能

CSTPCD北大核心

影响因子：0.954

ISSN：1003-6059

年,卷(期)：2024.37(5)