Landmark-based k-factorization multi-view subspace clustering

Multi-view subspace clustering (MSC) has gained significant popularity due to its ability to overcome noise and bias present in single views by fusing information from multiple views. MSC enhances the accuracy and robustness of clustering. However, many existing MSC methods suffer from high computat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information sciences 2024-05, Vol.667, p.120480, Article 120480
Hauptverfasser: Fang, Yuan, Yang, Geping, Chen, Xiang, Gong, Zhiguo, Yang, Yiyang, Chen, Can, Hao, Zhifeng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Multi-view subspace clustering (MSC) has gained significant popularity due to its ability to overcome noise and bias present in single views by fusing information from multiple views. MSC enhances the accuracy and robustness of clustering. However, many existing MSC methods suffer from high computational costs and sub-optimal performance on large-scale datasets, since they often construct a fused graph directly from high-dimensional data and then apply spectral clustering. To address these challenges, we propose a framework called Landmark-based k-Factorization Multi-view Subspace Clustering (LKMSC). Our framework tackles these issues by generating a small number of landmarks p≪n for each view, which form a landmark graph. We represent each entire view as a linear combination of these landmarks, where n is the number of data points. To address inconsistencies that naturally occur in landmark graphs due to multiple views, we utilize Landmark Graphs Alignment. This technique incorporates both feature and structural information to capture the correspondence between landmarks. The aligned graphs are then factorized into consensus k groups, emphasizing structural sparsity. LKMSC efficiently extracts features and reduces dimensionality of the input dataset. It eliminates the need for learning large-scale affinity matrix and feature decomposition. Our approach exhibits linear computational complexity and has demonstrated promising results in numerous experimental evaluations across a range of datasets. The source codes and datasets are available at https://github.com/FY0109/LKMSC.
ISSN:0020-0255
DOI:10.1016/j.ins.2024.120480