基于属性依存增强的文搜图行人重识别

Text-to-Image Person Reidentification Based on Attribute Dependency Augmentation

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：文搜图行人重识别旨在通过给定的文本从行人图库中检索目标人物,主要挑战的是学习对自由视角(姿势、照明和相机视点)的图像和自由形式的文本具备鲁棒特征.然而,由于在文本描述和行人图像中存在对行人属性挖掘的不足,在细粒度上因细节的差异从而影响了从文本描述到行人图像的检索性能.因此,研究提出了基于属性依存增强的文搜图行人重识别.首先,从文本描述解析出依存关系,并转化为依存矩阵.其次,设计了一个基于自注意力的属性干预模块来融合文本特征和依存矩阵,得到属性增强的文本特征.此时,文本特征经过干预,更为关注属性信息.最后,文本特征与图像特征参与训练,让整个网络对属性的挖掘更为敏感.在两个数据集CUHK-PEDES和ICFG-PEDES上进行实验,证明了模型的有效性.

外文摘要：Text-to-Image Person Reidentification(TIPR)aims to retrieve a target person from a pedestrian gal-lery with a given text,and its main challenge is to learn the robust features of free-view(posture,lighting and cam-era viewpoint)images and free-form texts.However,due to the lack of pedestrian attribute mining in text descrip-tions and pedestrian images,the retrieval performance from text descriptions to pedestrian images is affected by dif-ferences in details in fine granularity.Therefore,this study proposes TIPR based on Attribute Dependency Aug-mentation(ADA).Firstly,it analyzes dependencies from text descriptions and transforms them into dependency ma-trixes.Then,it designs an attribute intervention module based on self-attention to fuse text features and depen-dency matrixes and obtains attribute-augmented text features which are more concerned about attribute informa-tion after intervention.Finally,it allows text features and image features participate in training,making the whole network more sensitive to attribute mining.Experiments on two datasets CUHK-PEDES and ICFG-PEDES dem-onstrate the effectiveness of the proposed model.

外文关键词：

Text-to-Image Person ReidentificationSelf-attention mechanismSyntactic dependencyFree view

作者：

夏威、袁鑫攀

展开 >

作者单位：

湖南工业大学湖南株洲 412000

关键词：

文搜图行人重识别自注意力机制句法依存自由视角

基金：

湖南省自然科学基金(2022)湖南省教育厅科研项目(2022)

项目编号：

2022JJ3023122B0559

出版年：

2024

DOI：

10.16661/j.cnki.1672-3791.2402-5042-7813

科技资讯

北京国际科技服务中心北京合作创新国际科技服务中心

科技资讯

影响因子：0.51

ISSN：1672-3791

年,卷(期)：2024.22(8)

参考文献量1