Dual space latent representation learning for unsupervised feature selection
•This paper proposes a novel feature selection algorithm called DSLRL.•DSLRL exploits the internal association information of data space and feature space to guide feature selection.•DSLRL proposes dual space latent representation learning, which characterizes the inherent structure of data space an...
Gespeichert in:
Veröffentlicht in: | Pattern recognition 2021-06, Vol.114, p.107873, Article 107873 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •This paper proposes a novel feature selection algorithm called DSLRL.•DSLRL exploits the internal association information of data space and feature space to guide feature selection.•DSLRL proposes dual space latent representation learning, which characterizes the inherent structure of data space and feature space.•DSLRL optimizes the low-dimensional latent representation matrix of data space as a pseudo-label matrix to provide clustering indicators.•DSLRL makes full use of the intrinsic information of dual space to guide the learning of sparse transformation matrix.•Experimental results on real datasets show the effectiveness of DSLRL.
In real-world applications, data instances are not only related to high-dimensional features, but also interconnected with each other. However, the interconnection information has not been fully exploited for feature selection. To address this issue, we propose a novel feature selection algorithm, called dual space latent representation learning for unsupervised feature selection (DSLRL), which exploits the internal association information of data space and feature space to guide feature selection. Firstly, based on latent representation learning in data space, DSLRL produces dual space latent representation learning, which characterizes the inherent structure of data space and feature space, respectively. Secondly, in order to overcome the problem of the lack of label information, DSLRL optimizes the low-dimensional latent representation matrix of data space as a pseudo-label matrix to provide clustering indicators. Moreover, the latent representation matrix of feature space is unified with the transformation matrix to benefit the matching of the data matrix and the clustering indicator matrix. In addition, DSLRL uses non-negative and orthogonal conditions to constrain the sparse transform matrix, making it more accurate for evaluating features. Finally, an alternating method is employed to optimize the objective function. Compared with seven state-of-the-art algorithms, experimental results on twelve datasets show the effectiveness of DSLRL. |
---|---|
ISSN: | 0031-3203 1873-5142 |
DOI: | 10.1016/j.patcog.2021.107873 |