PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery
Text-based person search, employing free-form text queries to identify individuals within a vast image collection, presents a unique challenge in aligning visual and textual representations, particularly at the human part level. Existing methods often struggle with part feature extraction and alignm...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Text-based person search, employing free-form text queries to identify
individuals within a vast image collection, presents a unique challenge in
aligning visual and textual representations, particularly at the human part
level. Existing methods often struggle with part feature extraction and
alignment due to the lack of direct part-level supervision and reliance on
heuristic features. We propose a novel framework that leverages a part
discovery module based on slot attention to autonomously identify and align
distinctive parts across modalities, enhancing interpretability and retrieval
accuracy without explicit part-level correspondence supervision. Additionally,
text-based dynamic part attention adjusts the importance of each part, further
improving retrieval outcomes. Our method is evaluated on three public
benchmarks, significantly outperforming existing methods. |
---|---|
DOI: | 10.48550/arxiv.2409.13475 |