泛化迁移深度学习下的跨模态图像行人识别算法

Pedestrian Recognition Algorithm of Cross-Modal Image under Generalized Transfer Deep Learning

蔡现龙 ¹李阳 ¹陈曦¹

扫码查看

作者信息

1. 西安明德理工学院信息工程学院,西安 710124
折叠

摘要

针对由于受光照条件变化、行人身高差异等影响,致使监控视频图像在不同时刻的成像存在较大的跨模态差异问题,为准确识别跨模态图像中的行人,提出基于泛化迁移深度学习的跨模态图像行人识别算法.通过循环生成对抗网络(Cyele GAN:Cycle Generative Adversarial Network)形成跨模态图像,采用单目标图像处理对基准图分割处理,得到人体候选区域,在匹配图中搜索和其匹配的区域,得到人体区域的视差,通过视差提取人体区域的深度和透视特征.将注意力机制和跨模态行人识别相结合,分析两种不同类型图像的差异,将两个子空间映射到同一个特征空间,同时引入泛化迁移深度学习算法对损失函数度量学习,自动筛选跨模态图像的行人特征,最终通过模态融合模块将筛选的特征融合处理完成行人识别.实验结果表明,所提算法可以快速、准确地提取不同模态图像中的行人,识别效果较好.

Abstract

Due to the influence of changes in lighting conditions and pedestrian height differences,there are large cross modal differences in surveillance video images at different times.In order to accurately identify pedestrians in cross modal images,a pedestrian recognition algorithm based on generalized transfer depth learning is proposed.The cross modal image is formed through Cyele GAN(Cycle Generative Adversarial Network),and the reference map is segmented using single object image processing to obtain candidate human body regions.The matching regions are searched in the matching map to obtain the disparity of human body regions,and the depth and perspective features of human body regions are extracted through the disparity.The attention mechanism and cross modal pedestrian recognition are combined to analyze the differences between the two types of images.The two subspaces are mapped to the same feature space.And the generalized migration depth learning algorithm is introduced to learn the loss function measurement,automatically screen the pedestrian features of the cross modal images,and finally complete pedestrian recognition through the modal fusion module to fuse the filtered features.The experimental results show that the proposed algorithm can quickly and accurately extract pedestrians from different modal images,and the recognition effect is good.

关键词

泛化迁移深度学习/跨模态图像/行人识别/特征提取

Key words

generalization transfer deep learning/cross-modal images/pedestrian recognition/feature extraction

引用本文复制引用

基金项目

西安明德理工学院科研基金资助项目(2021XY01L09)

出版年

2024

吉林大学学报(信息科学版)

吉林大学

吉林大学学报(信息科学版)

CSTPCD

影响因子：0.607

ISSN：1671-5896

参考文献量10

段落导航