基于双重增强网络的跨模态行人重识别
Cross-modality person re-identification based on dual enhancement network
陈梦蝶 1卢健 1张奇1
作者信息
- 1. 西安工程大学电子信息学院,陕西西安 710600
- 折叠
摘要
针对异质样本差异、行人遮挡及背景干扰等造成的跨模态行人重识别(person re-identifica-tion,ReID)精度不高的问题,本文提出了一种基于通道与特征学习的双重增强网络(dual enhanced network,DEN).首先从通道级出发,通过随机交换可见光通道来挖掘可见光与红外通道间的关系,增强模型对多模态样本变化的鲁棒性.其次从特征级出发,在模态共享网络前引入基于归一化的注意力模块(normalization-based attention module,NAM),通过惩罚贡献因子较小的权重来避免噪声对模态不变信息学习造成一定干扰.同时采用特征分离模块(feature separation module,FSM)来分离出身份相关特征与身份无关特征,有效提升了模型对异质样本的识别能力.最后联合难样本三元组和加权正则化损失对网络进行监督训练,从而约束行人特征学习.在RegDB数据集上,DEN的Rank1准确率和mAP分别达到了 94.86%和90.10%的高水准.
Abstract
This paper proposes a dual enhanced network(DEN)based on channel and feature learning to address the problem of poor accuracy in cross-modality person re-identification(ReID)caused by hetero-geneous sample differences,person occlusion,and background interference.At the channel level,visible channels are randomly swapped to explore the relationship between visible and infrared channels,enhan-cing the model's robustness to multimodal sample changes.At the feature level,a normalization-based attention module(NAM)is introduced before module sharing network to avoid noise interference on modality-invariant information learning by punishing weights with smaller contribution factors,and a fea-ture separation module(FSM)is used to separate identity-related features from identity-independent features,improving the model's recognition ability for heterogeneous samples.Finally,the network is trained and supervised using hard sample triples and weighted regularization loss to constrain pedestrian feature learning.On the RegDB dataset,DEN achieves a high level of accuracy,with a Rank1 accuracy of 94.86%and mAP of 90.10%.
关键词
行人重识别(ReID)/跨模态/通道交换增强(CEA)/基于归一化的注意力模块(NAM)/特征分离模块(FSM)Key words
person re-identification(ReID)/cross-modality/channel exchangeable augmentation(CEA)/normalization-based attention module(NAM)/feature separation module(FSM)引用本文复制引用
基金项目
西安市碑林区应用技术研发资助项目(GX2007)
出版年
2024