Performance enhancement of classifiers through Bio inspired feature selection methods for early detection of lung cancer from microarray genes

Gene expression in the microarray is assimilated with redundant and high-dimensional information. Moreover, the information in the microarray genes mostly correlates with background noise. This paper uses dimensionality reduction and feature selection methods to employ a classification methodology f...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Heliyon 2024-08, Vol.10 (16), p.e36419, Article e36419
Hauptverfasser: M S, Karthika, Rajaguru, Harikumar, Nair, Ajin R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Gene expression in the microarray is assimilated with redundant and high-dimensional information. Moreover, the information in the microarray genes mostly correlates with background noise. This paper uses dimensionality reduction and feature selection methods to employ a classification methodology for high-dimensional lung cancer microarray data. The approach is enforced in two phases; initially, the genes are dimensionally reduced through Hilbert Transform, Detrend Fluctuation Analysis and Least Square Linear Regression methods. The dimensionally reduced data is further optimized in the next phase using Elephant Herd optimization (EHO) and Cuckoo Search Feature selection methods. The classifiers used here are Bayesian Linear Discriminant, Naive Bayes, Random Forest, Decision Tree, SVM (Linear), SVM (Polynomial), and SVM (RBF). The classifier's performances are analysed with and without feature selection methods. The SVM (Linear) classifier with the DFA Dimensionality Reduction method and EHO feature selection achieved the highest accuracy of 92.26 % compared to other classifiers.
ISSN:2405-8440
2405-8440
DOI:10.1016/j.heliyon.2024.e36419