Clustering applications of IFDBSCAN algorithm with comparative analysis

Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separa...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of intelligent & fuzzy systems 2020-01, Vol.39 (5), p.6099-6108
Hauptverfasser: Unver, Mustafa, Erginel, Nihal
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 6108
container_issue 5
container_start_page 6099
container_title Journal of intelligent & fuzzy systems
container_volume 39
creator Unver, Mustafa
Erginel, Nihal
description Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separates dense regions from low density regions and simultaneously assigns points that lie alone as outliers to unearth the hidden cluster patterns in the datasets. DBSCAN identifies dense regions by means of core point definition, detection of which are strictly dependent on input parameter definitions: ε is distance of the neighborhood or radius of hypersphere and MinPts is minimum density constraint inside ε radius hypersphere. Contrarily to classical DBSCAN’s crisp core point definition, intuitionistic fuzzy core point definition is proposed in our preliminary work to make DBSCAN algorithm capable of detecting different patterns of density by two different combinations of input parameters, particularly is a necessity for the density varying large datasets in multidimensional feature space. In this study, preliminarily proposed DBSCAN extension is studied: IFDBSCAN. The proposed extension is tested by computational experiments on several machine learning repository real-time datasets. Results show that, IFDBSCAN is superior to classical DBSCAN with respect to external & internal performance indices such as purity index, adjusted rand index, Fowlkes-Mallows score, silhouette coefficient, Calinski-Harabasz index and with respect to clustering structure results without increasing computational time so much, along with the possibility of trying two different density patterns on the same run and trying intermediary density values for the users by manipulating α margin.
doi_str_mv 10.3233/JIFS-189082
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2463680059</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2463680059</sourcerecordid><originalsourceid>FETCH-LOGICAL-c261t-71271aac775d7d9606fb6ef877d48aafe280c4111ef1e2de568f6494404780a63</originalsourceid><addsrcrecordid>eNotkLFOwzAQhi0EEqUw8QKWGFHgznFsZyyBlqIKhsJsmcQuqZI42Cmob0-qstzd8On0_x8h1wh3KUvT-5flfJ2gykGxEzJBJbNE5UKejjcIniDj4pxcxLgFQJkxmJBF0eziYEPdbajp-6YuzVD7LlLv6HL--LAuZq_UNBsf6uGrpb_jpKVvexNG7sdS05lmH-t4Sc6caaK9-t9T8jF_ei-ek9XbYlnMVknJBA6JRCbRmFLKrJJVLkC4T2GdkrLiyhhnmYKSI6J1aFllM6Gc4DnnwKUCI9IpuTn-7YP_3tk46K3fhTFE1GO5VCiALB-p2yNVBh9jsE73oW5N2GsEfTClD6b00VT6B49FWoQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2463680059</pqid></control><display><type>article</type><title>Clustering applications of IFDBSCAN algorithm with comparative analysis</title><source>Business Source Complete</source><creator>Unver, Mustafa ; Erginel, Nihal</creator><contributor>Kahraman, Cengiz</contributor><creatorcontrib>Unver, Mustafa ; Erginel, Nihal ; Kahraman, Cengiz</creatorcontrib><description>Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separates dense regions from low density regions and simultaneously assigns points that lie alone as outliers to unearth the hidden cluster patterns in the datasets. DBSCAN identifies dense regions by means of core point definition, detection of which are strictly dependent on input parameter definitions: ε is distance of the neighborhood or radius of hypersphere and MinPts is minimum density constraint inside ε radius hypersphere. Contrarily to classical DBSCAN’s crisp core point definition, intuitionistic fuzzy core point definition is proposed in our preliminary work to make DBSCAN algorithm capable of detecting different patterns of density by two different combinations of input parameters, particularly is a necessity for the density varying large datasets in multidimensional feature space. In this study, preliminarily proposed DBSCAN extension is studied: IFDBSCAN. The proposed extension is tested by computational experiments on several machine learning repository real-time datasets. Results show that, IFDBSCAN is superior to classical DBSCAN with respect to external &amp; internal performance indices such as purity index, adjusted rand index, Fowlkes-Mallows score, silhouette coefficient, Calinski-Harabasz index and with respect to clustering structure results without increasing computational time so much, along with the possibility of trying two different density patterns on the same run and trying intermediary density values for the users by manipulating α margin.</description><identifier>ISSN: 1064-1246</identifier><identifier>EISSN: 1875-8967</identifier><identifier>DOI: 10.3233/JIFS-189082</identifier><language>eng</language><publisher>Amsterdam: IOS Press BV</publisher><subject>Algorithms ; Clustering ; Computing time ; Datasets ; Density ; Hyperspheres ; Machine learning ; Outliers (statistics) ; Parameters ; Performance indices</subject><ispartof>Journal of intelligent &amp; fuzzy systems, 2020-01, Vol.39 (5), p.6099-6108</ispartof><rights>Copyright IOS Press BV 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c261t-71271aac775d7d9606fb6ef877d48aafe280c4111ef1e2de568f6494404780a63</citedby><cites>FETCH-LOGICAL-c261t-71271aac775d7d9606fb6ef877d48aafe280c4111ef1e2de568f6494404780a63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><contributor>Kahraman, Cengiz</contributor><creatorcontrib>Unver, Mustafa</creatorcontrib><creatorcontrib>Erginel, Nihal</creatorcontrib><title>Clustering applications of IFDBSCAN algorithm with comparative analysis</title><title>Journal of intelligent &amp; fuzzy systems</title><description>Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separates dense regions from low density regions and simultaneously assigns points that lie alone as outliers to unearth the hidden cluster patterns in the datasets. DBSCAN identifies dense regions by means of core point definition, detection of which are strictly dependent on input parameter definitions: ε is distance of the neighborhood or radius of hypersphere and MinPts is minimum density constraint inside ε radius hypersphere. Contrarily to classical DBSCAN’s crisp core point definition, intuitionistic fuzzy core point definition is proposed in our preliminary work to make DBSCAN algorithm capable of detecting different patterns of density by two different combinations of input parameters, particularly is a necessity for the density varying large datasets in multidimensional feature space. In this study, preliminarily proposed DBSCAN extension is studied: IFDBSCAN. The proposed extension is tested by computational experiments on several machine learning repository real-time datasets. Results show that, IFDBSCAN is superior to classical DBSCAN with respect to external &amp; internal performance indices such as purity index, adjusted rand index, Fowlkes-Mallows score, silhouette coefficient, Calinski-Harabasz index and with respect to clustering structure results without increasing computational time so much, along with the possibility of trying two different density patterns on the same run and trying intermediary density values for the users by manipulating α margin.</description><subject>Algorithms</subject><subject>Clustering</subject><subject>Computing time</subject><subject>Datasets</subject><subject>Density</subject><subject>Hyperspheres</subject><subject>Machine learning</subject><subject>Outliers (statistics)</subject><subject>Parameters</subject><subject>Performance indices</subject><issn>1064-1246</issn><issn>1875-8967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNotkLFOwzAQhi0EEqUw8QKWGFHgznFsZyyBlqIKhsJsmcQuqZI42Cmob0-qstzd8On0_x8h1wh3KUvT-5flfJ2gykGxEzJBJbNE5UKejjcIniDj4pxcxLgFQJkxmJBF0eziYEPdbajp-6YuzVD7LlLv6HL--LAuZq_UNBsf6uGrpb_jpKVvexNG7sdS05lmH-t4Sc6caaK9-t9T8jF_ei-ek9XbYlnMVknJBA6JRCbRmFLKrJJVLkC4T2GdkrLiyhhnmYKSI6J1aFllM6Gc4DnnwKUCI9IpuTn-7YP_3tk46K3fhTFE1GO5VCiALB-p2yNVBh9jsE73oW5N2GsEfTClD6b00VT6B49FWoQ</recordid><startdate>20200101</startdate><enddate>20200101</enddate><creator>Unver, Mustafa</creator><creator>Erginel, Nihal</creator><general>IOS Press BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20200101</creationdate><title>Clustering applications of IFDBSCAN algorithm with comparative analysis</title><author>Unver, Mustafa ; Erginel, Nihal</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c261t-71271aac775d7d9606fb6ef877d48aafe280c4111ef1e2de568f6494404780a63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Clustering</topic><topic>Computing time</topic><topic>Datasets</topic><topic>Density</topic><topic>Hyperspheres</topic><topic>Machine learning</topic><topic>Outliers (statistics)</topic><topic>Parameters</topic><topic>Performance indices</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Unver, Mustafa</creatorcontrib><creatorcontrib>Erginel, Nihal</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of intelligent &amp; fuzzy systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Unver, Mustafa</au><au>Erginel, Nihal</au><au>Kahraman, Cengiz</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Clustering applications of IFDBSCAN algorithm with comparative analysis</atitle><jtitle>Journal of intelligent &amp; fuzzy systems</jtitle><date>2020-01-01</date><risdate>2020</risdate><volume>39</volume><issue>5</issue><spage>6099</spage><epage>6108</epage><pages>6099-6108</pages><issn>1064-1246</issn><eissn>1875-8967</eissn><abstract>Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separates dense regions from low density regions and simultaneously assigns points that lie alone as outliers to unearth the hidden cluster patterns in the datasets. DBSCAN identifies dense regions by means of core point definition, detection of which are strictly dependent on input parameter definitions: ε is distance of the neighborhood or radius of hypersphere and MinPts is minimum density constraint inside ε radius hypersphere. Contrarily to classical DBSCAN’s crisp core point definition, intuitionistic fuzzy core point definition is proposed in our preliminary work to make DBSCAN algorithm capable of detecting different patterns of density by two different combinations of input parameters, particularly is a necessity for the density varying large datasets in multidimensional feature space. In this study, preliminarily proposed DBSCAN extension is studied: IFDBSCAN. The proposed extension is tested by computational experiments on several machine learning repository real-time datasets. Results show that, IFDBSCAN is superior to classical DBSCAN with respect to external &amp; internal performance indices such as purity index, adjusted rand index, Fowlkes-Mallows score, silhouette coefficient, Calinski-Harabasz index and with respect to clustering structure results without increasing computational time so much, along with the possibility of trying two different density patterns on the same run and trying intermediary density values for the users by manipulating α margin.</abstract><cop>Amsterdam</cop><pub>IOS Press BV</pub><doi>10.3233/JIFS-189082</doi><tpages>10</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1064-1246
ispartof Journal of intelligent & fuzzy systems, 2020-01, Vol.39 (5), p.6099-6108
issn 1064-1246
1875-8967
language eng
recordid cdi_proquest_journals_2463680059
source Business Source Complete
subjects Algorithms
Clustering
Computing time
Datasets
Density
Hyperspheres
Machine learning
Outliers (statistics)
Parameters
Performance indices
title Clustering applications of IFDBSCAN algorithm with comparative analysis
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T08%3A02%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Clustering%20applications%20of%20IFDBSCAN%20algorithm%20with%20comparative%20analysis&rft.jtitle=Journal%20of%20intelligent%20&%20fuzzy%20systems&rft.au=Unver,%20Mustafa&rft.date=2020-01-01&rft.volume=39&rft.issue=5&rft.spage=6099&rft.epage=6108&rft.pages=6099-6108&rft.issn=1064-1246&rft.eissn=1875-8967&rft_id=info:doi/10.3233/JIFS-189082&rft_dat=%3Cproquest_cross%3E2463680059%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2463680059&rft_id=info:pmid/&rfr_iscdi=true