Exploring granularity-associated invariance features for text-to-image person re-identification: Exploring granularity-associated invariance features for text-to-image re-identification

Text-to-image person re-identification (TIReID) aims to identify and locate pedestrian images based on given textual description queries. The main challenge of the task is bridging the significant gap between text and image modalities. Previous works primarily utilize cross-modality matching constra...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Multimedia systems 2025, Vol.31 (1)
Hauptverfasser: Shao, Chenglong, Si, Tongzhen, Yang, Xiaohui
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Text-to-image person re-identification (TIReID) aims to identify and locate pedestrian images based on given textual description queries. The main challenge of the task is bridging the significant gap between text and image modalities. Previous works primarily utilize cross-modality matching constraints to align the global or local features between samples. However, these methods overlook the relationship inconsistency problem caused by different text descriptions and generate local information redundancy in the local feature extraction process. In this paper, we propose the Granularity-Associated Invariance Features (GAIF) learning strategy to explore potential cross-modality invariant information. Firstly, we propose Global Matching Relationship Improvement (GMRI) with dynamic constraint factors to regulate the matching relationships between different samples. Secondly, we construct the Local Joint Learning Strategy (LJLS) to iteratively optimize fine-grained information from representation learning or metric learning views. Furthermore, we integrate GMRI and LJLS into a unified framework and utilize various constraints to comprehensively optimize global and local associated invariant features. We conduct extensive experiments to assess the proposed GAIF on three TIReID benchmark databases. The experimental results demonstrate that the proposed GAIF outperforms most of the advanced methods in key criteria.
ISSN:0942-4962
1432-1882
DOI:10.1007/s00530-024-01638-9