ALEAN STACKED ENSEMBLE MODEL(LSEM) TO ENHANCE THE EFFECTIVENESS OF CLASSIFYING DATA WITH HUGE IMBALANCE

Knowledge discovery and analysis has become one of the major needs of the current information rich world. Effective information identification and prediction requires effective models. Several machine learning models are available for prediction. This paper concentrates on classification, a supervis...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of advanced research in computer science 2017-11, Vol.8 (9), p.87-93
1. Verfasser: Elavarasan, N.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Knowledge discovery and analysis has become one of the major needs of the current information rich world. Effective information identification and prediction requires effective models. Several machine learning models are available for prediction. This paper concentrates on classification, a supervised machine learning model. An effective classifier can enable effective predictions. However, not all input data are perfect to enable highly accurate classification. Several factors such as data imbalance, noise and borderline entries affect the classifiers. This paper proposes a Lean SVM based Ensemble Model (LSEM) that enables effective classification of data without the need for pre-processing. A heterogeneous ensemble is created using Random Forest and One-Class SVM. The requirement of partial training data for SVM makes the model lean, enabling faster training. Experiment is conducted on data with varied imbalance levels and it is identified that the proposed LSEM operates better than state-of-the-art models and ensembles and hence enabling better predictions.
ISSN:0976-5697
0976-5697
DOI:10.26483/ijarcs.v8i9.4913