A Weighted Similarity Measure for k-Nearest Neighbors Algorithm

One of the most important problems in machine learning, which has gained importance in recent years, is classification. The k-nearest neighbors (kNN) algorithm is widely used in classification problem because it is a simple and effective method. However, there are several factors affecting the perfo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Celal Bayar Üniversitesi Fen Bilimleri Dergisi 2019-12, Vol.15 (4), p.393-400
Hauptverfasser: KARABULUT, Bergen, ARSLAN, Güvenç, ÜNVER, Halil Murat
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 400
container_issue 4
container_start_page 393
container_title Celal Bayar Üniversitesi Fen Bilimleri Dergisi
container_volume 15
creator KARABULUT, Bergen
ARSLAN, Güvenç
ÜNVER, Halil Murat
description One of the most important problems in machine learning, which has gained importance in recent years, is classification. The k-nearest neighbors (kNN) algorithm is widely used in classification problem because it is a simple and effective method. However, there are several factors affecting the performance of kNN algorithm. One of them is determining an appropriate proximity (distance or similarity) measure. Although the Euclidean distance is often used as a proximity measure in the application of the kNN, studies show that the use of different proximity measures can improve the performance of the kNN. In this study, we propose the Weighted Similarity k-Nearest Neighbors algorithm (WS-kNN) which use a weighted similarity as proximity measure in the kNN algorithm. Firstly, it calculates the weight of each attribute and similarity between the instances in the dataset. And then, it weights similarities by attribute weights and creates a weighted similarity matrix to use as proximity measure. The proposed algorithm is compared with the classical kNN method based on the Euclidean distance. To verify the performance of our algorithm, experiments are made on 10 different real-life datasets from the UCI (UC Irvine Machine Learning Repository) by classification accuracy. Experimental results show that the proposed WS-kNN algorithm can achieve comparative classification accuracy. For some datasets, this new algorithm gives highly good results. In addition, we demonstrated that the use of different proximity measures can affect the classification accuracy of kNN algorithm.
doi_str_mv 10.18466/cbayarfbe.618964
format Article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_18466_cbayarfbe_618964</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_18466_cbayarfbe_618964</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1334-c3d8728b62ee1975219e0ba356fd573e82db1c80ad01896eb847e44eed506c9e3</originalsourceid><addsrcrecordid>eNo9j7tOwzAYhT2ARFX6AGx-ARff40woqrhJpQxQlc3y5U9rkSjIDkPenpYilnOWT0fnQ-iG0SUzUuvb4N3kcuthqZmptbxAMyaoIsf4uEKLUpKnQnNplNEzdNfgHaT9YYSI31KfOpfTOOEXcOU7A26HjD_JBlyGMuLNifRDLrjp9sMRPPTX6LJ1XYHFX8_R9uH-ffVE1q-Pz6tmTQITQpIgoqm48ZoDsLpSnNVAvRNKt1FVAgyPngVDXaSn0-CNrEBKgKioDjWIOWLn3ZCHUjK09iun3uXJMmp_xe2_uD2Lix9G21Af</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Weighted Similarity Measure for k-Nearest Neighbors Algorithm</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>KARABULUT, Bergen ; ARSLAN, Güvenç ; ÜNVER, Halil Murat</creator><creatorcontrib>KARABULUT, Bergen ; ARSLAN, Güvenç ; ÜNVER, Halil Murat</creatorcontrib><description>One of the most important problems in machine learning, which has gained importance in recent years, is classification. The k-nearest neighbors (kNN) algorithm is widely used in classification problem because it is a simple and effective method. However, there are several factors affecting the performance of kNN algorithm. One of them is determining an appropriate proximity (distance or similarity) measure. Although the Euclidean distance is often used as a proximity measure in the application of the kNN, studies show that the use of different proximity measures can improve the performance of the kNN. In this study, we propose the Weighted Similarity k-Nearest Neighbors algorithm (WS-kNN) which use a weighted similarity as proximity measure in the kNN algorithm. Firstly, it calculates the weight of each attribute and similarity between the instances in the dataset. And then, it weights similarities by attribute weights and creates a weighted similarity matrix to use as proximity measure. The proposed algorithm is compared with the classical kNN method based on the Euclidean distance. To verify the performance of our algorithm, experiments are made on 10 different real-life datasets from the UCI (UC Irvine Machine Learning Repository) by classification accuracy. Experimental results show that the proposed WS-kNN algorithm can achieve comparative classification accuracy. For some datasets, this new algorithm gives highly good results. In addition, we demonstrated that the use of different proximity measures can affect the classification accuracy of kNN algorithm.</description><identifier>ISSN: 1305-130X</identifier><identifier>DOI: 10.18466/cbayarfbe.618964</identifier><language>eng</language><ispartof>Celal Bayar Üniversitesi Fen Bilimleri Dergisi, 2019-12, Vol.15 (4), p.393-400</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c1334-c3d8728b62ee1975219e0ba356fd573e82db1c80ad01896eb847e44eed506c9e3</citedby><cites>FETCH-LOGICAL-c1334-c3d8728b62ee1975219e0ba356fd573e82db1c80ad01896eb847e44eed506c9e3</cites><orcidid>0000-0003-0755-1289 ; 0000-0002-4770-2689</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>KARABULUT, Bergen</creatorcontrib><creatorcontrib>ARSLAN, Güvenç</creatorcontrib><creatorcontrib>ÜNVER, Halil Murat</creatorcontrib><title>A Weighted Similarity Measure for k-Nearest Neighbors Algorithm</title><title>Celal Bayar Üniversitesi Fen Bilimleri Dergisi</title><description>One of the most important problems in machine learning, which has gained importance in recent years, is classification. The k-nearest neighbors (kNN) algorithm is widely used in classification problem because it is a simple and effective method. However, there are several factors affecting the performance of kNN algorithm. One of them is determining an appropriate proximity (distance or similarity) measure. Although the Euclidean distance is often used as a proximity measure in the application of the kNN, studies show that the use of different proximity measures can improve the performance of the kNN. In this study, we propose the Weighted Similarity k-Nearest Neighbors algorithm (WS-kNN) which use a weighted similarity as proximity measure in the kNN algorithm. Firstly, it calculates the weight of each attribute and similarity between the instances in the dataset. And then, it weights similarities by attribute weights and creates a weighted similarity matrix to use as proximity measure. The proposed algorithm is compared with the classical kNN method based on the Euclidean distance. To verify the performance of our algorithm, experiments are made on 10 different real-life datasets from the UCI (UC Irvine Machine Learning Repository) by classification accuracy. Experimental results show that the proposed WS-kNN algorithm can achieve comparative classification accuracy. For some datasets, this new algorithm gives highly good results. In addition, we demonstrated that the use of different proximity measures can affect the classification accuracy of kNN algorithm.</description><issn>1305-130X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNo9j7tOwzAYhT2ARFX6AGx-ARff40woqrhJpQxQlc3y5U9rkSjIDkPenpYilnOWT0fnQ-iG0SUzUuvb4N3kcuthqZmptbxAMyaoIsf4uEKLUpKnQnNplNEzdNfgHaT9YYSI31KfOpfTOOEXcOU7A26HjD_JBlyGMuLNifRDLrjp9sMRPPTX6LJ1XYHFX8_R9uH-ffVE1q-Pz6tmTQITQpIgoqm48ZoDsLpSnNVAvRNKt1FVAgyPngVDXaSn0-CNrEBKgKioDjWIOWLn3ZCHUjK09iun3uXJMmp_xe2_uD2Lix9G21Af</recordid><startdate>20191230</startdate><enddate>20191230</enddate><creator>KARABULUT, Bergen</creator><creator>ARSLAN, Güvenç</creator><creator>ÜNVER, Halil Murat</creator><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-0755-1289</orcidid><orcidid>https://orcid.org/0000-0002-4770-2689</orcidid></search><sort><creationdate>20191230</creationdate><title>A Weighted Similarity Measure for k-Nearest Neighbors Algorithm</title><author>KARABULUT, Bergen ; ARSLAN, Güvenç ; ÜNVER, Halil Murat</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1334-c3d8728b62ee1975219e0ba356fd573e82db1c80ad01896eb847e44eed506c9e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>KARABULUT, Bergen</creatorcontrib><creatorcontrib>ARSLAN, Güvenç</creatorcontrib><creatorcontrib>ÜNVER, Halil Murat</creatorcontrib><collection>CrossRef</collection><jtitle>Celal Bayar Üniversitesi Fen Bilimleri Dergisi</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>KARABULUT, Bergen</au><au>ARSLAN, Güvenç</au><au>ÜNVER, Halil Murat</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Weighted Similarity Measure for k-Nearest Neighbors Algorithm</atitle><jtitle>Celal Bayar Üniversitesi Fen Bilimleri Dergisi</jtitle><date>2019-12-30</date><risdate>2019</risdate><volume>15</volume><issue>4</issue><spage>393</spage><epage>400</epage><pages>393-400</pages><issn>1305-130X</issn><abstract>One of the most important problems in machine learning, which has gained importance in recent years, is classification. The k-nearest neighbors (kNN) algorithm is widely used in classification problem because it is a simple and effective method. However, there are several factors affecting the performance of kNN algorithm. One of them is determining an appropriate proximity (distance or similarity) measure. Although the Euclidean distance is often used as a proximity measure in the application of the kNN, studies show that the use of different proximity measures can improve the performance of the kNN. In this study, we propose the Weighted Similarity k-Nearest Neighbors algorithm (WS-kNN) which use a weighted similarity as proximity measure in the kNN algorithm. Firstly, it calculates the weight of each attribute and similarity between the instances in the dataset. And then, it weights similarities by attribute weights and creates a weighted similarity matrix to use as proximity measure. The proposed algorithm is compared with the classical kNN method based on the Euclidean distance. To verify the performance of our algorithm, experiments are made on 10 different real-life datasets from the UCI (UC Irvine Machine Learning Repository) by classification accuracy. Experimental results show that the proposed WS-kNN algorithm can achieve comparative classification accuracy. For some datasets, this new algorithm gives highly good results. In addition, we demonstrated that the use of different proximity measures can affect the classification accuracy of kNN algorithm.</abstract><doi>10.18466/cbayarfbe.618964</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0003-0755-1289</orcidid><orcidid>https://orcid.org/0000-0002-4770-2689</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1305-130X
ispartof Celal Bayar Üniversitesi Fen Bilimleri Dergisi, 2019-12, Vol.15 (4), p.393-400
issn 1305-130X
language eng
recordid cdi_crossref_primary_10_18466_cbayarfbe_618964
source Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
title A Weighted Similarity Measure for k-Nearest Neighbors Algorithm
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T20%3A05%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Weighted%20Similarity%20Measure%20for%20k-Nearest%20Neighbors%20Algorithm&rft.jtitle=Celal%20Bayar%20%C3%9Cniversitesi%20Fen%20Bilimleri%20Dergisi&rft.au=KARABULUT,%20Bergen&rft.date=2019-12-30&rft.volume=15&rft.issue=4&rft.spage=393&rft.epage=400&rft.pages=393-400&rft.issn=1305-130X&rft_id=info:doi/10.18466/cbayarfbe.618964&rft_dat=%3Ccrossref%3E10_18466_cbayarfbe_618964%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true