A comparative analysis of data normalization on data mining classification performance

Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Utomo, Dito Putro, Mesran, M., Sarwandi, S., Aripin, Soeb, Syahrizal, Muhammad, Pristiwanto, P.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Algorithms Canonical forms Data mining Data processing Standard scores
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from fully running well because the data stored in the dataset sometimes are not in a normal form. One of the problems encountered in random data is that there is a considerable distance between data, which sets an impediment to data processing. This problem can be solved using normalization. Normalization is also generally referred to as simplification. Some algorithms such as the min-max normalization and Z-score algorithms can be used for normalization. The results of the testing on the use of the min-max normalization and Z-score algorithms for normalization revealed that the former had better performance than the latter. This was judged from the magnitude of the increase in accuracy obtained from the use of both algorithms, in which case min-max normalization gained an increase of 0.41%, while Z-score normalization did an increase of 0.14%.
ISSN:	0094-243X 1551-7616
DOI:	10.1063/5.0208001