A Multi-Step Nonlinear Dimension-Reduction Approach with Applications to Big Data

In this paper, a novel dimension-reduction approach is presented to overcome challenges such as nonlinear relationships, heterogeneity, and noisy dimensions. Initially, the p attributes in the data are first organized into random groups. Next, to systematically remove redundant and noisy dimensions...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on knowledge and data engineering 2019-12, Vol.31 (12), p.2249-2261
Hauptverfasser:	Krishnan, R., Samaranayake, V. A., Jagannathan, S.
Format:	Artikel
Sprache:	eng
Schlagworte:	Big Data classification Correlation Covariance Covariance matrices dimension-reduction Distance covariance Eigenvalues Eigenvalues and eigenfunctions Indexes Mapping Noise measurement Organizations Parameter estimation Reduction Transformations
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, a novel dimension-reduction approach is presented to overcome challenges such as nonlinear relationships, heterogeneity, and noisy dimensions. Initially, the p attributes in the data are first organized into random groups. Next, to systematically remove redundant and noisy dimensions from the data, each group is independently mapped into a low dimensional space via a parametric mapping. The group-wise transformation parameters are estimated using a low-rank approximation of distance covariance. The transformed attributes are reorganized into groups based on the magnitude of their respective eigenvalues. The group-wise organization and reduction process is performed until a user-defined criterion on eigenvalues is satisfied. In addition, novel procedures are introduced to aggregate the transformation parameters when the data is available in batches. Overall performance is demonstrated with extensive simulation analysis on classification by employing 10 data-sets.
ISSN:	1041-4347 1558-2191
DOI:	10.1109/TKDE.2018.2876848