A comparative analysis of data normalization on data mining classification performance

Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Utomo, Dito Putro, Mesran, M., Sarwandi, S., Aripin, Soeb, Syahrizal, Muhammad, Pristiwanto, P.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page
container_title
container_volume 3048
creator Utomo, Dito Putro
Mesran, M.
Sarwandi, S.
Aripin, Soeb
Syahrizal, Muhammad
Pristiwanto, P.
description Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from fully running well because the data stored in the dataset sometimes are not in a normal form. One of the problems encountered in random data is that there is a considerable distance between data, which sets an impediment to data processing. This problem can be solved using normalization. Normalization is also generally referred to as simplification. Some algorithms such as the min-max normalization and Z-score algorithms can be used for normalization. The results of the testing on the use of the min-max normalization and Z-score algorithms for normalization revealed that the former had better performance than the latter. This was judged from the magnitude of the increase in accuracy obtained from the use of both algorithms, in which case min-max normalization gained an increase of 0.41%, while Z-score normalization did an increase of 0.14%.
doi_str_mv 10.1063/5.0208001
format Conference Proceeding
fullrecord <record><control><sourceid>proquest_scita</sourceid><recordid>TN_cdi_proquest_journals_3032785767</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3032785767</sourcerecordid><originalsourceid>FETCH-LOGICAL-p1681-81b2b335e040de7844e2d4bb776f8ae15668912be4ec3e0d2f938295c849ee6e3</originalsourceid><addsrcrecordid>eNotkElLA0EQhRtRMEYP_oMGb8LE6r3nGIIbBLyoeBt6emqkw2x2T4T4650sUFBQ7-Px6hFyy2DBQIsHtQAOFoCdkRlTimVGM31OZgC5zLgUX5fkKqUNAM-NsTPyuaS-bwcX3Rh-kbrONbsUEu1rWrnR0a6PrWvC3yT3HZ3mcG1DF7pv6huXUqiDP6oDxnqPdx6vyUXtmoQ3pz0nH0-P76uXbP32_LparrOBacsyy0peCqEQJFRorJTIK1mWxujaOmRKa5szXqJELxAqXufC8lx5K3NEjWJO7o6-Q-x_tpjGYtNv4_REKgQIbqwy2kzU_ZFKPoyHrMUQQ-virmBQ7HsrVHHqTfwDODFfqQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>3032785767</pqid></control><display><type>conference_proceeding</type><title>A comparative analysis of data normalization on data mining classification performance</title><source>American Institute of Physics (AIP) Journals</source><creator>Utomo, Dito Putro ; Mesran, M. ; Sarwandi, S. ; Aripin, Soeb ; Syahrizal, Muhammad ; Pristiwanto, P.</creator><contributor>Kurniawati, Heny ; Radiansyah, Ing Rajih ; Rahim, Robbi</contributor><creatorcontrib>Utomo, Dito Putro ; Mesran, M. ; Sarwandi, S. ; Aripin, Soeb ; Syahrizal, Muhammad ; Pristiwanto, P. ; Kurniawati, Heny ; Radiansyah, Ing Rajih ; Rahim, Robbi</creatorcontrib><description>Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from fully running well because the data stored in the dataset sometimes are not in a normal form. One of the problems encountered in random data is that there is a considerable distance between data, which sets an impediment to data processing. This problem can be solved using normalization. Normalization is also generally referred to as simplification. Some algorithms such as the min-max normalization and Z-score algorithms can be used for normalization. The results of the testing on the use of the min-max normalization and Z-score algorithms for normalization revealed that the former had better performance than the latter. This was judged from the magnitude of the increase in accuracy obtained from the use of both algorithms, in which case min-max normalization gained an increase of 0.41%, while Z-score normalization did an increase of 0.14%.</description><identifier>ISSN: 0094-243X</identifier><identifier>EISSN: 1551-7616</identifier><identifier>DOI: 10.1063/5.0208001</identifier><identifier>CODEN: APCPCS</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Algorithms ; Canonical forms ; Data mining ; Data processing ; Standard scores</subject><ispartof>AIP Conference Proceedings, 2024, Vol.3048 (1)</ispartof><rights>Author(s)</rights><rights>2024 Author(s). Published under an exclusive license by AIP Publishing.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/acp/article-lookup/doi/10.1063/5.0208001$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>309,310,314,780,784,789,790,794,4512,23930,23931,25140,27924,27925,76384</link.rule.ids></links><search><contributor>Kurniawati, Heny</contributor><contributor>Radiansyah, Ing Rajih</contributor><contributor>Rahim, Robbi</contributor><creatorcontrib>Utomo, Dito Putro</creatorcontrib><creatorcontrib>Mesran, M.</creatorcontrib><creatorcontrib>Sarwandi, S.</creatorcontrib><creatorcontrib>Aripin, Soeb</creatorcontrib><creatorcontrib>Syahrizal, Muhammad</creatorcontrib><creatorcontrib>Pristiwanto, P.</creatorcontrib><title>A comparative analysis of data normalization on data mining classification performance</title><title>AIP Conference Proceedings</title><description>Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from fully running well because the data stored in the dataset sometimes are not in a normal form. One of the problems encountered in random data is that there is a considerable distance between data, which sets an impediment to data processing. This problem can be solved using normalization. Normalization is also generally referred to as simplification. Some algorithms such as the min-max normalization and Z-score algorithms can be used for normalization. The results of the testing on the use of the min-max normalization and Z-score algorithms for normalization revealed that the former had better performance than the latter. This was judged from the magnitude of the increase in accuracy obtained from the use of both algorithms, in which case min-max normalization gained an increase of 0.41%, while Z-score normalization did an increase of 0.14%.</description><subject>Algorithms</subject><subject>Canonical forms</subject><subject>Data mining</subject><subject>Data processing</subject><subject>Standard scores</subject><issn>0094-243X</issn><issn>1551-7616</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNotkElLA0EQhRtRMEYP_oMGb8LE6r3nGIIbBLyoeBt6emqkw2x2T4T4650sUFBQ7-Px6hFyy2DBQIsHtQAOFoCdkRlTimVGM31OZgC5zLgUX5fkKqUNAM-NsTPyuaS-bwcX3Rh-kbrONbsUEu1rWrnR0a6PrWvC3yT3HZ3mcG1DF7pv6huXUqiDP6oDxnqPdx6vyUXtmoQ3pz0nH0-P76uXbP32_LparrOBacsyy0peCqEQJFRorJTIK1mWxujaOmRKa5szXqJELxAqXufC8lx5K3NEjWJO7o6-Q-x_tpjGYtNv4_REKgQIbqwy2kzU_ZFKPoyHrMUQQ-virmBQ7HsrVHHqTfwDODFfqQ</recordid><startdate>20240404</startdate><enddate>20240404</enddate><creator>Utomo, Dito Putro</creator><creator>Mesran, M.</creator><creator>Sarwandi, S.</creator><creator>Aripin, Soeb</creator><creator>Syahrizal, Muhammad</creator><creator>Pristiwanto, P.</creator><general>American Institute of Physics</general><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>20240404</creationdate><title>A comparative analysis of data normalization on data mining classification performance</title><author>Utomo, Dito Putro ; Mesran, M. ; Sarwandi, S. ; Aripin, Soeb ; Syahrizal, Muhammad ; Pristiwanto, P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p1681-81b2b335e040de7844e2d4bb776f8ae15668912be4ec3e0d2f938295c849ee6e3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Canonical forms</topic><topic>Data mining</topic><topic>Data processing</topic><topic>Standard scores</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Utomo, Dito Putro</creatorcontrib><creatorcontrib>Mesran, M.</creatorcontrib><creatorcontrib>Sarwandi, S.</creatorcontrib><creatorcontrib>Aripin, Soeb</creatorcontrib><creatorcontrib>Syahrizal, Muhammad</creatorcontrib><creatorcontrib>Pristiwanto, P.</creatorcontrib><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Utomo, Dito Putro</au><au>Mesran, M.</au><au>Sarwandi, S.</au><au>Aripin, Soeb</au><au>Syahrizal, Muhammad</au><au>Pristiwanto, P.</au><au>Kurniawati, Heny</au><au>Radiansyah, Ing Rajih</au><au>Rahim, Robbi</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A comparative analysis of data normalization on data mining classification performance</atitle><btitle>AIP Conference Proceedings</btitle><date>2024-04-04</date><risdate>2024</risdate><volume>3048</volume><issue>1</issue><issn>0094-243X</issn><eissn>1551-7616</eissn><coden>APCPCS</coden><abstract>Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from fully running well because the data stored in the dataset sometimes are not in a normal form. One of the problems encountered in random data is that there is a considerable distance between data, which sets an impediment to data processing. This problem can be solved using normalization. Normalization is also generally referred to as simplification. Some algorithms such as the min-max normalization and Z-score algorithms can be used for normalization. The results of the testing on the use of the min-max normalization and Z-score algorithms for normalization revealed that the former had better performance than the latter. This was judged from the magnitude of the increase in accuracy obtained from the use of both algorithms, in which case min-max normalization gained an increase of 0.41%, while Z-score normalization did an increase of 0.14%.</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0208001</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0094-243X
ispartof AIP Conference Proceedings, 2024, Vol.3048 (1)
issn 0094-243X
1551-7616
language eng
recordid cdi_proquest_journals_3032785767
source American Institute of Physics (AIP) Journals
subjects Algorithms
Canonical forms
Data mining
Data processing
Standard scores
title A comparative analysis of data normalization on data mining classification performance
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T11%3A40%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_scita&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20comparative%20analysis%20of%20data%20normalization%20on%20data%20mining%20classification%20performance&rft.btitle=AIP%20Conference%20Proceedings&rft.au=Utomo,%20Dito%20Putro&rft.date=2024-04-04&rft.volume=3048&rft.issue=1&rft.issn=0094-243X&rft.eissn=1551-7616&rft.coden=APCPCS&rft_id=info:doi/10.1063/5.0208001&rft_dat=%3Cproquest_scita%3E3032785767%3C/proquest_scita%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3032785767&rft_id=info:pmid/&rfr_iscdi=true