Learning global and local features using graph neural networks for person re-identification

Person re-identification (re-id) is the task of recognizing an individual across non-overlapping camera views. Some approaches only rely on extracting global appearance features from images and fail to consider people’s local body information (head, foot, body shape), which can be used as complement...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Signal processing. Image communication 2022-09, Vol.107, p.116744, Article 116744
Hauptverfasser: Zhang, Ji, Ainam, Jean-Paul, Song, Wenai, Zhao, Li-hui, Wang, Xin, Li, Hongzhou
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Person re-identification (re-id) is the task of recognizing an individual across non-overlapping camera views. Some approaches only rely on extracting global appearance features from images and fail to consider people’s local body information (head, foot, body shape), which can be used as complementary. Other techniques combine local and global features but rely on external information such as pedestrian attributes or human pose to locate and align the local regions. This strategy increases the learning difficulty and is not efficient or robust to real-world scenarios. In this paper, we propose an end-to-end deep learning framework to overcome these limitations. Our method combines the global and local feature representations of a pedestrian and captures the body structural information by modeling the spatial relation of patches using graph neural networks. We also represent the relationships between probe–gallery pairs using a graph neural network and propose incorporating a scoring function to mine a correspondence for local regions. Experimental results on several datasets validate the effectiveness of the proposed method. •A model that exploits both global and local features.•GNNs that model the relations of patches and represent pairwise relationships.•A scoring function that improves alignment and learns discriminative features.•A model that performs favorably well against state-of-the-art methods.
ISSN:0923-5965
1879-2677
DOI:10.1016/j.image.2022.116744