Homogeneous and Heterogeneous Optimization for Unsupervised Cross-Modality Person Re-Identification in Visual Internet of Things

Cross-modality visible-infrared person re-identification (VI-ReID) has attracted widespread concern due to its scalability in 24-h video surveillance of the Visual Internet of Things (VIoT). Driven by enough annotated training data, supervised VI-ReID has achieved superior performance. However, anno...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE internet of things journal 2023-11, p.1-1
Hauptverfasser: Si, Tongzhen, He, Fazhi, Li, Penglei, Ye, Mang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Cross-modality visible-infrared person re-identification (VI-ReID) has attracted widespread concern due to its scalability in 24-h video surveillance of the Visual Internet of Things (VIoT). Driven by enough annotated training data, supervised VI-ReID has achieved superior performance. However, annotating a large amount of cross-modality data is extremely time-consuming, which limits its employment in real-world scenarios. Existing several works neglect the image-level discrepancy and could not obtain reliable feature-level heterogeneous correlation. In the paper, we propose a novel Homogeneous and Heterogeneous Optimization with Modality Style Adaptation (HHO) mechanism to eliminate intra-modality and inter-modality discrepancies without any label information for unsupervised VI-ReID. Specifically, we present the modality style adaptation strategy to transfer unlabeled cross-modality pedestrian styles, which not only increases the image diversity but also bridges the inter-modality gap. Meanwhile, we employ the clustering algorithm to generate pseudo labels for each modality. The homogeneous feature optimization is developed to extract intra-modality pedestrian features. Furthermore, we propose heterogeneous feature optimization to eliminate the inter-modality discrepancy. To this end, a heterogeneous feature search (HFS) module is designed to mine reliable cross-modality signals for each identity. These reliable heterogeneous features are constrained to generate the compact feature distribution, while different identities are forced to be separated. The homogeneous and heterogeneous optimization are seamlessly integrated to learn cross-modality robust features. Abundant experiments prove the superiority of HHO, which gains superior performance.
ISSN:2327-4662
DOI:10.1109/JIOT.2023.3332077