A Cross-Modal Person Re-Identification Based on Relationship Mining
Aimed at the problems that while paying attention to the text-based person re-identification models often relying on global and local feature in alignment,very often the inter--modal and intra-modal correlations are in negative,a cross-modal pedestrian re-identification method is proposed based on rela-tionship mining.The method includes a dual-stream network backbone,negative similarity mining mod-ule,and relationship encoder module.Firstly,the global and the local feature are in alignment through the dual-stream network backbone.Secondly the granularity of feature discrimination is enhanced by using the negative similarity mining module,and the similar incorrect results are filtered out.Finally,the relation-ship encoder module is utilized for respectively learning the latent relationship information in both the im-age and text,achieving relationship-level feature alignment.The experimental results on the CUHK-PEDES dataset and the ICFG-PEDES dataset show that this method achieves recognition accuracy higher.
person re-identificationmulti-granularity imagetext alignmentsrelationship feature fusionconvolutional neural networkglobal featurelocal feature