An evolutionary Pentagon Support Vector finder method
•Detecting outliers is vital in data mining and classification algorithms.•In this sense, we propose an evolutionary Pentagon Support Vector finder method.•We use geometrical calculations and evolutionary clustering to make a more effective system.•Our proposed approach successfully removes outliers...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2020-07, Vol.150, p.113284, Article 113284 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •Detecting outliers is vital in data mining and classification algorithms.•In this sense, we propose an evolutionary Pentagon Support Vector finder method.•We use geometrical calculations and evolutionary clustering to make a more effective system.•Our proposed approach successfully removes outliers from all data sets.•We do not lose vital samples and do not harm final accuracy.
In dealing with big data, we need effective algorithms; effectiveness that depends, among others, on the ability to remove outliers from the data set, especially when dealing with classification problems. To this aim, support vector finder algorithms have been created to save just the most important data in the data pool. Nevertheless, existing classification algorithms, such as Fuzzy C-Means (FCM), suffer from the drawback of setting the initial cluster centers imprecisely. In this paper, we avoid existing shortcomings and aim to find and remove unnecessary data in order to speed up the final classification task without losing vital samples and without harming final accuracy; in this sense, we present a unique approach for finding support vectors, named evolutionary Pentagon Support Vector (PSV) finder method. The originality of the current research lies in using geometrical computations and evolutionary algorithms to make a more effective system, which has the advantage of higher accuracy on some data sets. The proposed method is subsequently tested with seven benchmark data sets and the results are compared to those obtained from performing classification on the original data (classification before and after PSV) under the same conditions. The testing returned promising results. |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2020.113284 |