Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning
Multiple clustering aims to discover various latent structures of data from different aspects. Deep multiple clustering methods have achieved remarkable performance by exploiting complex patterns and relationships in data. However, existing works struggle to flexibly adapt to diverse user-specific n...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Multiple clustering aims to discover various latent structures of data from
different aspects. Deep multiple clustering methods have achieved remarkable
performance by exploiting complex patterns and relationships in data. However,
existing works struggle to flexibly adapt to diverse user-specific needs in
data grouping, which may require manual understanding of each clustering. To
address these limitations, we introduce Multi-Sub, a novel end-to-end multiple
clustering approach that incorporates a multi-modal subspace proxy learning
framework in this work. Utilizing the synergistic capabilities of CLIP and
GPT-4, Multi-Sub aligns textual prompts expressing user preferences with their
corresponding visual representations. This is achieved by automatically
generating proxy words from large language models that act as subspace bases,
thus allowing for the customized representation of data in terms specific to
the user's interests. Our method consistently outperforms existing baselines
across a broad set of datasets in visual multiple clustering tasks. Our code is
available at https://github.com/Alexander-Yao/Multi-Sub. |
---|---|
DOI: | 10.48550/arxiv.2411.03978 |