A Weighted Similarity Measure for k-Nearest Neighbors Algorithm
One of the most important problems in machine learning, which has gained importance in recent years, is classification. The k-nearest neighbors (kNN) algorithm is widely used in classification problem because it is a simple and effective method. However, there are several factors affecting the perfo...
Gespeichert in:
Veröffentlicht in: | Celal Bayar Üniversitesi Fen Bilimleri Dergisi 2019-12, Vol.15 (4), p.393-400 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 400 |
---|---|
container_issue | 4 |
container_start_page | 393 |
container_title | Celal Bayar Üniversitesi Fen Bilimleri Dergisi |
container_volume | 15 |
creator | KARABULUT, Bergen ARSLAN, Güvenç ÜNVER, Halil Murat |
description | One of the most important problems in machine
learning, which has gained importance in recent years, is classification. The
k-nearest neighbors (kNN) algorithm is widely used in classification problem
because it is a simple and effective method. However, there are several factors
affecting the performance of kNN algorithm. One of them is determining an
appropriate proximity (distance or similarity) measure. Although the Euclidean
distance is often used as a proximity measure in the application of the kNN,
studies show that the use of different proximity measures can improve the performance
of the kNN. In this study, we propose the Weighted Similarity k-Nearest
Neighbors algorithm (WS-kNN) which use a weighted
similarity as proximity measure in the kNN algorithm. Firstly, it
calculates the weight of each attribute and similarity between the instances in
the dataset. And then, it weights similarities by attribute weights and creates
a weighted similarity matrix to use as proximity measure. The proposed
algorithm is compared with the classical kNN method based on the Euclidean
distance. To verify the performance of our algorithm, experiments are made on
10 different real-life datasets from the UCI (UC Irvine Machine Learning
Repository) by classification accuracy. Experimental results show that the
proposed WS-kNN algorithm can achieve comparative classification accuracy. For
some datasets, this new algorithm gives highly good results. In addition, we demonstrated
that the use of different proximity measures can affect the classification
accuracy of kNN algorithm. |
doi_str_mv | 10.18466/cbayarfbe.618964 |
format | Article |
fullrecord | <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_18466_cbayarfbe_618964</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_18466_cbayarfbe_618964</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1334-c3d8728b62ee1975219e0ba356fd573e82db1c80ad01896eb847e44eed506c9e3</originalsourceid><addsrcrecordid>eNo9j7tOwzAYhT2ARFX6AGx-ARff40woqrhJpQxQlc3y5U9rkSjIDkPenpYilnOWT0fnQ-iG0SUzUuvb4N3kcuthqZmptbxAMyaoIsf4uEKLUpKnQnNplNEzdNfgHaT9YYSI31KfOpfTOOEXcOU7A26HjD_JBlyGMuLNifRDLrjp9sMRPPTX6LJ1XYHFX8_R9uH-ffVE1q-Pz6tmTQITQpIgoqm48ZoDsLpSnNVAvRNKt1FVAgyPngVDXaSn0-CNrEBKgKioDjWIOWLn3ZCHUjK09iun3uXJMmp_xe2_uD2Lix9G21Af</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Weighted Similarity Measure for k-Nearest Neighbors Algorithm</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>KARABULUT, Bergen ; ARSLAN, Güvenç ; ÜNVER, Halil Murat</creator><creatorcontrib>KARABULUT, Bergen ; ARSLAN, Güvenç ; ÜNVER, Halil Murat</creatorcontrib><description>One of the most important problems in machine
learning, which has gained importance in recent years, is classification. The
k-nearest neighbors (kNN) algorithm is widely used in classification problem
because it is a simple and effective method. However, there are several factors
affecting the performance of kNN algorithm. One of them is determining an
appropriate proximity (distance or similarity) measure. Although the Euclidean
distance is often used as a proximity measure in the application of the kNN,
studies show that the use of different proximity measures can improve the performance
of the kNN. In this study, we propose the Weighted Similarity k-Nearest
Neighbors algorithm (WS-kNN) which use a weighted
similarity as proximity measure in the kNN algorithm. Firstly, it
calculates the weight of each attribute and similarity between the instances in
the dataset. And then, it weights similarities by attribute weights and creates
a weighted similarity matrix to use as proximity measure. The proposed
algorithm is compared with the classical kNN method based on the Euclidean
distance. To verify the performance of our algorithm, experiments are made on
10 different real-life datasets from the UCI (UC Irvine Machine Learning
Repository) by classification accuracy. Experimental results show that the
proposed WS-kNN algorithm can achieve comparative classification accuracy. For
some datasets, this new algorithm gives highly good results. In addition, we demonstrated
that the use of different proximity measures can affect the classification
accuracy of kNN algorithm.</description><identifier>ISSN: 1305-130X</identifier><identifier>DOI: 10.18466/cbayarfbe.618964</identifier><language>eng</language><ispartof>Celal Bayar Üniversitesi Fen Bilimleri Dergisi, 2019-12, Vol.15 (4), p.393-400</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c1334-c3d8728b62ee1975219e0ba356fd573e82db1c80ad01896eb847e44eed506c9e3</citedby><cites>FETCH-LOGICAL-c1334-c3d8728b62ee1975219e0ba356fd573e82db1c80ad01896eb847e44eed506c9e3</cites><orcidid>0000-0003-0755-1289 ; 0000-0002-4770-2689</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>KARABULUT, Bergen</creatorcontrib><creatorcontrib>ARSLAN, Güvenç</creatorcontrib><creatorcontrib>ÜNVER, Halil Murat</creatorcontrib><title>A Weighted Similarity Measure for k-Nearest Neighbors Algorithm</title><title>Celal Bayar Üniversitesi Fen Bilimleri Dergisi</title><description>One of the most important problems in machine
learning, which has gained importance in recent years, is classification. The
k-nearest neighbors (kNN) algorithm is widely used in classification problem
because it is a simple and effective method. However, there are several factors
affecting the performance of kNN algorithm. One of them is determining an
appropriate proximity (distance or similarity) measure. Although the Euclidean
distance is often used as a proximity measure in the application of the kNN,
studies show that the use of different proximity measures can improve the performance
of the kNN. In this study, we propose the Weighted Similarity k-Nearest
Neighbors algorithm (WS-kNN) which use a weighted
similarity as proximity measure in the kNN algorithm. Firstly, it
calculates the weight of each attribute and similarity between the instances in
the dataset. And then, it weights similarities by attribute weights and creates
a weighted similarity matrix to use as proximity measure. The proposed
algorithm is compared with the classical kNN method based on the Euclidean
distance. To verify the performance of our algorithm, experiments are made on
10 different real-life datasets from the UCI (UC Irvine Machine Learning
Repository) by classification accuracy. Experimental results show that the
proposed WS-kNN algorithm can achieve comparative classification accuracy. For
some datasets, this new algorithm gives highly good results. In addition, we demonstrated
that the use of different proximity measures can affect the classification
accuracy of kNN algorithm.</description><issn>1305-130X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNo9j7tOwzAYhT2ARFX6AGx-ARff40woqrhJpQxQlc3y5U9rkSjIDkPenpYilnOWT0fnQ-iG0SUzUuvb4N3kcuthqZmptbxAMyaoIsf4uEKLUpKnQnNplNEzdNfgHaT9YYSI31KfOpfTOOEXcOU7A26HjD_JBlyGMuLNifRDLrjp9sMRPPTX6LJ1XYHFX8_R9uH-ffVE1q-Pz6tmTQITQpIgoqm48ZoDsLpSnNVAvRNKt1FVAgyPngVDXaSn0-CNrEBKgKioDjWIOWLn3ZCHUjK09iun3uXJMmp_xe2_uD2Lix9G21Af</recordid><startdate>20191230</startdate><enddate>20191230</enddate><creator>KARABULUT, Bergen</creator><creator>ARSLAN, Güvenç</creator><creator>ÜNVER, Halil Murat</creator><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-0755-1289</orcidid><orcidid>https://orcid.org/0000-0002-4770-2689</orcidid></search><sort><creationdate>20191230</creationdate><title>A Weighted Similarity Measure for k-Nearest Neighbors Algorithm</title><author>KARABULUT, Bergen ; ARSLAN, Güvenç ; ÜNVER, Halil Murat</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1334-c3d8728b62ee1975219e0ba356fd573e82db1c80ad01896eb847e44eed506c9e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>KARABULUT, Bergen</creatorcontrib><creatorcontrib>ARSLAN, Güvenç</creatorcontrib><creatorcontrib>ÜNVER, Halil Murat</creatorcontrib><collection>CrossRef</collection><jtitle>Celal Bayar Üniversitesi Fen Bilimleri Dergisi</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>KARABULUT, Bergen</au><au>ARSLAN, Güvenç</au><au>ÜNVER, Halil Murat</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Weighted Similarity Measure for k-Nearest Neighbors Algorithm</atitle><jtitle>Celal Bayar Üniversitesi Fen Bilimleri Dergisi</jtitle><date>2019-12-30</date><risdate>2019</risdate><volume>15</volume><issue>4</issue><spage>393</spage><epage>400</epage><pages>393-400</pages><issn>1305-130X</issn><abstract>One of the most important problems in machine
learning, which has gained importance in recent years, is classification. The
k-nearest neighbors (kNN) algorithm is widely used in classification problem
because it is a simple and effective method. However, there are several factors
affecting the performance of kNN algorithm. One of them is determining an
appropriate proximity (distance or similarity) measure. Although the Euclidean
distance is often used as a proximity measure in the application of the kNN,
studies show that the use of different proximity measures can improve the performance
of the kNN. In this study, we propose the Weighted Similarity k-Nearest
Neighbors algorithm (WS-kNN) which use a weighted
similarity as proximity measure in the kNN algorithm. Firstly, it
calculates the weight of each attribute and similarity between the instances in
the dataset. And then, it weights similarities by attribute weights and creates
a weighted similarity matrix to use as proximity measure. The proposed
algorithm is compared with the classical kNN method based on the Euclidean
distance. To verify the performance of our algorithm, experiments are made on
10 different real-life datasets from the UCI (UC Irvine Machine Learning
Repository) by classification accuracy. Experimental results show that the
proposed WS-kNN algorithm can achieve comparative classification accuracy. For
some datasets, this new algorithm gives highly good results. In addition, we demonstrated
that the use of different proximity measures can affect the classification
accuracy of kNN algorithm.</abstract><doi>10.18466/cbayarfbe.618964</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0003-0755-1289</orcidid><orcidid>https://orcid.org/0000-0002-4770-2689</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1305-130X |
ispartof | Celal Bayar Üniversitesi Fen Bilimleri Dergisi, 2019-12, Vol.15 (4), p.393-400 |
issn | 1305-130X |
language | eng |
recordid | cdi_crossref_primary_10_18466_cbayarfbe_618964 |
source | Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals |
title | A Weighted Similarity Measure for k-Nearest Neighbors Algorithm |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T20%3A05%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Weighted%20Similarity%20Measure%20for%20k-Nearest%20Neighbors%20Algorithm&rft.jtitle=Celal%20Bayar%20%C3%9Cniversitesi%20Fen%20Bilimleri%20Dergisi&rft.au=KARABULUT,%20Bergen&rft.date=2019-12-30&rft.volume=15&rft.issue=4&rft.spage=393&rft.epage=400&rft.pages=393-400&rft.issn=1305-130X&rft_id=info:doi/10.18466/cbayarfbe.618964&rft_dat=%3Ccrossref%3E10_18466_cbayarfbe_618964%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |