Reducing the dimension of the feature space for the classification problem using a genetic programming algorithm

Often, a decrease in the efficiency of machine learning algorithms is associated with the problem of classification quality on data with a large dimension of the feature space. Therefore, for data mining methods in solving classification problems, finding the optimal feature space is an important ta...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Melnikova, Olga, Tausnev, Daniil, Lipinskiy, Leonid, Polyakova, Anastasiya
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Often, a decrease in the efficiency of machine learning algorithms is associated with the problem of classification quality on data with a large dimension of the feature space. Therefore, for data mining methods in solving classification problems, finding the optimal feature space is an important task. A method is proposed to dimensionality reduction of the feature space by constructing new features (a new feature space) using a genetic programming algorithm. The dimension of the space to which the source data needs to be compressed can be set at will. After dimensionality reduction, the accuracy of classification on the new feature space has almost minimal differences with respect to the accuracy of classification on the original feature space. In addition, the sample distribution of the original feature space is almost statistically not significantly different from the constructed one. The approbation of the obtained approach is presented on a representative set of test and practical tasks.
ISSN:0094-243X
1551-7616
DOI:10.1063/5.0106039