A comparative analysis of data normalization on data mining classification performance
Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | |
container_title | |
container_volume | 3048 |
creator | Utomo, Dito Putro Mesran, M. Sarwandi, S. Aripin, Soeb Syahrizal, Muhammad Pristiwanto, P. |
description | Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from fully running well because the data stored in the dataset sometimes are not in a normal form. One of the problems encountered in random data is that there is a considerable distance between data, which sets an impediment to data processing. This problem can be solved using normalization. Normalization is also generally referred to as simplification. Some algorithms such as the min-max normalization and Z-score algorithms can be used for normalization. The results of the testing on the use of the min-max normalization and Z-score algorithms for normalization revealed that the former had better performance than the latter. This was judged from the magnitude of the increase in accuracy obtained from the use of both algorithms, in which case min-max normalization gained an increase of 0.41%, while Z-score normalization did an increase of 0.14%. |
doi_str_mv | 10.1063/5.0208001 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>proquest_scita</sourceid><recordid>TN_cdi_proquest_journals_3032785767</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3032785767</sourcerecordid><originalsourceid>FETCH-LOGICAL-p1681-81b2b335e040de7844e2d4bb776f8ae15668912be4ec3e0d2f938295c849ee6e3</originalsourceid><addsrcrecordid>eNotkElLA0EQhRtRMEYP_oMGb8LE6r3nGIIbBLyoeBt6emqkw2x2T4T4650sUFBQ7-Px6hFyy2DBQIsHtQAOFoCdkRlTimVGM31OZgC5zLgUX5fkKqUNAM-NsTPyuaS-bwcX3Rh-kbrONbsUEu1rWrnR0a6PrWvC3yT3HZ3mcG1DF7pv6huXUqiDP6oDxnqPdx6vyUXtmoQ3pz0nH0-P76uXbP32_LparrOBacsyy0peCqEQJFRorJTIK1mWxujaOmRKa5szXqJELxAqXufC8lx5K3NEjWJO7o6-Q-x_tpjGYtNv4_REKgQIbqwy2kzU_ZFKPoyHrMUQQ-virmBQ7HsrVHHqTfwDODFfqQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>3032785767</pqid></control><display><type>conference_proceeding</type><title>A comparative analysis of data normalization on data mining classification performance</title><source>American Institute of Physics (AIP) Journals</source><creator>Utomo, Dito Putro ; Mesran, M. ; Sarwandi, S. ; Aripin, Soeb ; Syahrizal, Muhammad ; Pristiwanto, P.</creator><contributor>Kurniawati, Heny ; Radiansyah, Ing Rajih ; Rahim, Robbi</contributor><creatorcontrib>Utomo, Dito Putro ; Mesran, M. ; Sarwandi, S. ; Aripin, Soeb ; Syahrizal, Muhammad ; Pristiwanto, P. ; Kurniawati, Heny ; Radiansyah, Ing Rajih ; Rahim, Robbi</creatorcontrib><description>Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from fully running well because the data stored in the dataset sometimes are not in a normal form. One of the problems encountered in random data is that there is a considerable distance between data, which sets an impediment to data processing. This problem can be solved using normalization. Normalization is also generally referred to as simplification. Some algorithms such as the min-max normalization and Z-score algorithms can be used for normalization. The results of the testing on the use of the min-max normalization and Z-score algorithms for normalization revealed that the former had better performance than the latter. This was judged from the magnitude of the increase in accuracy obtained from the use of both algorithms, in which case min-max normalization gained an increase of 0.41%, while Z-score normalization did an increase of 0.14%.</description><identifier>ISSN: 0094-243X</identifier><identifier>EISSN: 1551-7616</identifier><identifier>DOI: 10.1063/5.0208001</identifier><identifier>CODEN: APCPCS</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Algorithms ; Canonical forms ; Data mining ; Data processing ; Standard scores</subject><ispartof>AIP Conference Proceedings, 2024, Vol.3048 (1)</ispartof><rights>Author(s)</rights><rights>2024 Author(s). Published under an exclusive license by AIP Publishing.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/acp/article-lookup/doi/10.1063/5.0208001$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>309,310,314,780,784,789,790,794,4512,23930,23931,25140,27924,27925,76384</link.rule.ids></links><search><contributor>Kurniawati, Heny</contributor><contributor>Radiansyah, Ing Rajih</contributor><contributor>Rahim, Robbi</contributor><creatorcontrib>Utomo, Dito Putro</creatorcontrib><creatorcontrib>Mesran, M.</creatorcontrib><creatorcontrib>Sarwandi, S.</creatorcontrib><creatorcontrib>Aripin, Soeb</creatorcontrib><creatorcontrib>Syahrizal, Muhammad</creatorcontrib><creatorcontrib>Pristiwanto, P.</creatorcontrib><title>A comparative analysis of data normalization on data mining classification performance</title><title>AIP Conference Proceedings</title><description>Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from fully running well because the data stored in the dataset sometimes are not in a normal form. One of the problems encountered in random data is that there is a considerable distance between data, which sets an impediment to data processing. This problem can be solved using normalization. Normalization is also generally referred to as simplification. Some algorithms such as the min-max normalization and Z-score algorithms can be used for normalization. The results of the testing on the use of the min-max normalization and Z-score algorithms for normalization revealed that the former had better performance than the latter. This was judged from the magnitude of the increase in accuracy obtained from the use of both algorithms, in which case min-max normalization gained an increase of 0.41%, while Z-score normalization did an increase of 0.14%.</description><subject>Algorithms</subject><subject>Canonical forms</subject><subject>Data mining</subject><subject>Data processing</subject><subject>Standard scores</subject><issn>0094-243X</issn><issn>1551-7616</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNotkElLA0EQhRtRMEYP_oMGb8LE6r3nGIIbBLyoeBt6emqkw2x2T4T4650sUFBQ7-Px6hFyy2DBQIsHtQAOFoCdkRlTimVGM31OZgC5zLgUX5fkKqUNAM-NsTPyuaS-bwcX3Rh-kbrONbsUEu1rWrnR0a6PrWvC3yT3HZ3mcG1DF7pv6huXUqiDP6oDxnqPdx6vyUXtmoQ3pz0nH0-P76uXbP32_LparrOBacsyy0peCqEQJFRorJTIK1mWxujaOmRKa5szXqJELxAqXufC8lx5K3NEjWJO7o6-Q-x_tpjGYtNv4_REKgQIbqwy2kzU_ZFKPoyHrMUQQ-virmBQ7HsrVHHqTfwDODFfqQ</recordid><startdate>20240404</startdate><enddate>20240404</enddate><creator>Utomo, Dito Putro</creator><creator>Mesran, M.</creator><creator>Sarwandi, S.</creator><creator>Aripin, Soeb</creator><creator>Syahrizal, Muhammad</creator><creator>Pristiwanto, P.</creator><general>American Institute of Physics</general><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>20240404</creationdate><title>A comparative analysis of data normalization on data mining classification performance</title><author>Utomo, Dito Putro ; Mesran, M. ; Sarwandi, S. ; Aripin, Soeb ; Syahrizal, Muhammad ; Pristiwanto, P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p1681-81b2b335e040de7844e2d4bb776f8ae15668912be4ec3e0d2f938295c849ee6e3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Canonical forms</topic><topic>Data mining</topic><topic>Data processing</topic><topic>Standard scores</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Utomo, Dito Putro</creatorcontrib><creatorcontrib>Mesran, M.</creatorcontrib><creatorcontrib>Sarwandi, S.</creatorcontrib><creatorcontrib>Aripin, Soeb</creatorcontrib><creatorcontrib>Syahrizal, Muhammad</creatorcontrib><creatorcontrib>Pristiwanto, P.</creatorcontrib><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Utomo, Dito Putro</au><au>Mesran, M.</au><au>Sarwandi, S.</au><au>Aripin, Soeb</au><au>Syahrizal, Muhammad</au><au>Pristiwanto, P.</au><au>Kurniawati, Heny</au><au>Radiansyah, Ing Rajih</au><au>Rahim, Robbi</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A comparative analysis of data normalization on data mining classification performance</atitle><btitle>AIP Conference Proceedings</btitle><date>2024-04-04</date><risdate>2024</risdate><volume>3048</volume><issue>1</issue><issn>0094-243X</issn><eissn>1551-7616</eissn><coden>APCPCS</coden><abstract>Data are a collection of information in the form of facts. Information is stored in data from various origins. Data processing is an important step that is currently carried out. Data processing is commonly performed using data mining. However, data processing usually face barriers that keep it from fully running well because the data stored in the dataset sometimes are not in a normal form. One of the problems encountered in random data is that there is a considerable distance between data, which sets an impediment to data processing. This problem can be solved using normalization. Normalization is also generally referred to as simplification. Some algorithms such as the min-max normalization and Z-score algorithms can be used for normalization. The results of the testing on the use of the min-max normalization and Z-score algorithms for normalization revealed that the former had better performance than the latter. This was judged from the magnitude of the increase in accuracy obtained from the use of both algorithms, in which case min-max normalization gained an increase of 0.41%, while Z-score normalization did an increase of 0.14%.</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0208001</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0094-243X |
ispartof | AIP Conference Proceedings, 2024, Vol.3048 (1) |
issn | 0094-243X 1551-7616 |
language | eng |
recordid | cdi_proquest_journals_3032785767 |
source | American Institute of Physics (AIP) Journals |
subjects | Algorithms Canonical forms Data mining Data processing Standard scores |
title | A comparative analysis of data normalization on data mining classification performance |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T11%3A40%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_scita&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20comparative%20analysis%20of%20data%20normalization%20on%20data%20mining%20classification%20performance&rft.btitle=AIP%20Conference%20Proceedings&rft.au=Utomo,%20Dito%20Putro&rft.date=2024-04-04&rft.volume=3048&rft.issue=1&rft.issn=0094-243X&rft.eissn=1551-7616&rft.coden=APCPCS&rft_id=info:doi/10.1063/5.0208001&rft_dat=%3Cproquest_scita%3E3032785767%3C/proquest_scita%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3032785767&rft_id=info:pmid/&rfr_iscdi=true |