An ensemble of the distance-based and Naive Bayes classifiers for the online classification with data reduction

The paper proposes two variants of the ensemble distance-based and Naive-Bayes online classifiers with data reduction. In the first variant the reduced dataset is obtained through applying bias-correction fuzzy clustering. In the second we used the kernel-based fuzzy clustering as the data reduction...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of intelligent & fuzzy systems 2017-01, Vol.32 (2), p.1289-1296
Hauptverfasser: Jędrzejowicz, Joanna, Jędrzejowicz, Piotr
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The paper proposes two variants of the ensemble distance-based and Naive-Bayes online classifiers with data reduction. In the first variant the reduced dataset is obtained through applying bias-correction fuzzy clustering. In the second we used the kernel-based fuzzy clustering as the data reduction tool. It is assumed that vectors of data with unknown class label arrive one by one, and that there is available an initial chunk of data with known class labels serving as the initial training set. Classification is carried-out in rounds. Each round involves a number of the classification decisions equal to the chunk size. For each round a set of base classifiers is constructed using different distance metrics. Set of base classifiers is extended with the Naive-Bayes classifier. The unknown label of each incoming vector is determined through weighted majority voting. After each round has been completed the training set is replaced by the fresh one and the classification process is continued. The approach is validated through computational experiment involving a number of datasets often used for testing data streams mining algorithms.
ISSN:1064-1246
1875-8967
DOI:10.3233/JIFS-169127