3D卷积增强的驾驶员人物交互行为识别

3D Convolutional Enhanced Driver Human and Object Interaction Behavior Recognition

程鸣 ¹严运兵²

扫码查看

作者信息

1. 武汉科技大学汽车与交通工程学院,湖北武汉 430072;东风汽车集团有限公司技术中心,湖北武汉 430058
2. 武汉科技大学汽车与交通工程学院,湖北武汉 430072
折叠

摘要

驾驶员抽烟和打电话的行为属于典型的人物交互行为,为提高模型在驾驶环境中,对遮挡和光照变化的抵抗能力以及人物交互行为匹配的准确性,研究首先提出一种2D扩张分组注意力机制方法对目标检测网络进行优化,提高人路和物路的小目标检测性能;然后提出一种3D扩张分组注意力机制与3D分组卷积融合的高精度轻量化模块,构建动态视频的行为识别模型,增强时序空间的非线性特征提取能力;最后将图片的帧间交并比统计判断结果与动态视频行为识别模型预测的结果相融合以做出最终的驾驶员人物交互行为判断.实验结果证明,2D和3D扩张分组注意力机制在行为识别中的有效性,驾驶员人物交互行为平均准确率和召回率提高了12.5％及7.72％.尤其在香烟和手机遮挡或光线条件不利的场景下提升明显,并能解决驾驶员与其后排乘客的行为混淆识别问题.

Abstract

The behavior of drivers smoking and making phone calls is a typical human interaction task,in order to improve the model's resistance to occlusion and light changes in the driving environment,as well as the accuracy of human interaction behavior matching.This study first proposes a 2D expanded group attention mechanism method to optimize the target detection network and improve the small target detection performance of human and object paths;Then,a high-precision lightweight module is proposed that combines the 3D expanded group attention mechanism with 3D group convolution to construct a dynamic video behavior recognition model and enhance the nonlinear feature extraction ability of temporal space;Finally,the inter frame intersection and union ratio statistical judgment results of the image are combined with the predicted results of the dynamic video behavior recognition model to make the final driver character interaction behavior judgment.The experimental results demonstrate the effectiveness of 2D and 3D expanded group attention mechanisms in behavior recognition,with an average accuracy and recall improvement of 12.5％and 7.72％in driver character interaction behavior.Especially in scenarios where cigarettes and mobile phones are obstructed or the lighting conditions are unfavorable,the improvement is significant,and it can solve the problem of confusion and recognition between the driver and their rear passengers'behavior.

关键词

人物交互行为/3D注意力机制/3D卷积/抽烟/打电话

Key words

HOI/3D attention/3D convolution/smoking/calls

引用本文复制引用

基金项目

国家自然科学基金(51975428)

出版年

2024

物流工程与管理

中国仓储协会全国商品养护科技情报中心站

物流工程与管理

影响因子：0.412

ISSN：1674-4993

参考文献量8

段落导航