基于混合注意力机制的多信息行人过街意图预测

Multi information pedestrian crossing intention prediction based on mixed attention mechanism

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：提前预测道路两旁的行人是否存在过街意图或一段时间后是否会出现过街行为是自动驾驶汽车面临的重要挑战之一,如何有效融合不同模态的多元信息是准确预测行人过街意图的重要问题.基于此,提出一种基于混合注意力机制的多信息融合预测模型,使用一种基于交叉注意力机制的图像特征融合网络来提取原始图像与语义图像之间的互补信息,并使模型更加关注与行人过街行为有关的图像部分.同时,提出一种融合注意力机制的分级GRU模块,用以捕捉不同模态的非视觉信息对行人过街意图的影响.在PIE和JAAD数据集上对所提模型进行对比实验,己验证其具有领先于同类研究的性能;针对所提出模块进行广泛的消融实验,表明其有效性.

外文摘要：Predicting in advance whether pedestrians on both sides of the road have the intention to cross the street or whether crossing behavior will occur after a period of time is one of the important challenges facing self-driving cars.How to effectively fuse the multi-information from these different modalities is an important issue in accurately predicting pedestrian crossing intentions.Therefore,this paper proposes a multi-information fusion prediction model based on a hybrid attention mechanism.The model uses an image feature fusion network based on a cross-attention mechanism to extract complementary information between the original image and the semantic image and to make the model more attentive to the parts of the image that are relevant to the behavior of the pedestrian crossing the street.We also propose a hierarchical gated recurr ent unit(GRU)module incorporating an attentional mechanism to capture the effects of different modalities of non-visual information on pedestrian crossing intentions.Finally,the proposed model is compared on the PIE and JAAD datasets and achieves leading performance,and extensive ablation experiments are conducted on the proposed module to prove its effectiveness.

外文关键词：

prediction ofpedestrian crossing intentioncross attention mechanismautonomous drivingvideo analysiscomputer visionmulti-information fusion

作者：

桑海峰、刘玉龙、刘泉恺

展开 >

作者单位：

沈阳工业大学信息科学与工程学院,沈阳 110870

关键词：

行人过街意图预测交叉注意力机制自动驾驶视频分析计算机视觉多信息融合

出版年：

2024

DOI：

10.13195/j.kzyjc.2023.1406

控制与决策

东北大学

控制与决策

CSTPCD北大核心

影响因子：1.227

ISSN：1001-0920

年,卷(期)：2024.39(12)