Sparse Subspace Clustering via Two-Step Reweighted L1-Minimization: Algorithm and Provable Neighbor Recovery Rates

Sparse subspace clustering (SSC) relies on sparse regression for accurate neighbor identification. Inspired by recent progress in compressive sensing, this paper proposes a new sparse regression scheme for SSC via two-step reweighted \ell _{1} -minimization, which also generalizes a two-step \ell...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on information theory 2021-02, Vol.67 (2), p.1216-1263
Hauptverfasser:	Wu, Jwo-Yuh, Huang, Liang-Chi, Yang, Ming-Hsun, Liu, Chun-Hung
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Classification algorithms Clustering Clustering algorithms compressive sensing discovery rate Eigenvalues and eigenfunctions Indexes Mathematical analysis Minimization neighbor recovery Noise measurement Optimization Partitioning algorithms performance guarantees Recovery Regression analysis sparse representation sparse subspace clustering Statistical analysis Subspace clustering Upper bounds weighted LASSO Weighting
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Sparse subspace clustering (SSC) relies on sparse regression for accurate neighbor identification. Inspired by recent progress in compressive sensing, this paper proposes a new sparse regression scheme for SSC via two-step reweighted \ell _{1} -minimization, which also generalizes a two-step \ell _{1} -minimization algorithm introduced by E. J. Candès et al. in [ The Annals of Statistics , vol. 42, no. 2, pp. 669-699, 2014] without incurring extra algorithmic complexity. To fully exploit the prior information offered by the computed sparse representation vector in the first step, our approach places a weight on each component of the regression vector, and solves a weighted LASSO in the second step. We propose a data weighting rule suitable for enhancing neighbor identification accuracy. Then, under the formulation of the dual problem of weighted LASSO, we study in depth the theoretical neighbor recovery rates of the proposed scheme. Specifically, an interesting connection between the locations of nonzeros of the optimal sparse solution to the weighted LASSO and the indexes of the active constraints of the dual problem is established. Afterwards, under the semi-random model, analytic probability lower/upper bounds for various neighbor recovery events are derived. Our analytic results confirm that, with the aid of data weighting and if the prior neighbor information is accurate enough, the proposed scheme with a higher probability can produce many correct neighbors and few incorrect neighbors as compared to the solution without data weighting. Computer simulations are provided to validate our analytic study and evidence the effectiveness of the proposed approach.
ISSN:	0018-9448 1557-9654
DOI:	10.1109/TIT.2020.3039114