Clustering applications of IFDBSCAN algorithm with comparative analysis

Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separa...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of intelligent & fuzzy systems 2020-01, Vol.39 (5), p.6099-6108
Hauptverfasser:	Unver, Mustafa, Erginel, Nihal
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Clustering Computing time Datasets Density Hyperspheres Machine learning Outliers (statistics) Parameters Performance indices
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	6108
container_issue	5
container_start_page	6099
container_title	Journal of intelligent & fuzzy systems
container_volume	39
creator	Unver, Mustafa Erginel, Nihal
description	Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separates dense regions from low density regions and simultaneously assigns points that lie alone as outliers to unearth the hidden cluster patterns in the datasets. DBSCAN identifies dense regions by means of core point definition, detection of which are strictly dependent on input parameter definitions: ε is distance of the neighborhood or radius of hypersphere and MinPts is minimum density constraint inside ε radius hypersphere. Contrarily to classical DBSCAN’s crisp core point definition, intuitionistic fuzzy core point definition is proposed in our preliminary work to make DBSCAN algorithm capable of detecting different patterns of density by two different combinations of input parameters, particularly is a necessity for the density varying large datasets in multidimensional feature space. In this study, preliminarily proposed DBSCAN extension is studied: IFDBSCAN. The proposed extension is tested by computational experiments on several machine learning repository real-time datasets. Results show that, IFDBSCAN is superior to classical DBSCAN with respect to external & internal performance indices such as purity index, adjusted rand index, Fowlkes-Mallows score, silhouette coefficient, Calinski-Harabasz index and with respect to clustering structure results without increasing computational time so much, along with the possibility of trying two different density patterns on the same run and trying intermediary density values for the users by manipulating α margin.
doi_str_mv	10.3233/JIFS-189082
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2463680059</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2463680059</sourcerecordid><originalsourceid>FETCH-LOGICAL-c261t-71271aac775d7d9606fb6ef877d48aafe280c4111ef1e2de568f6494404780a63</originalsourceid><addsrcrecordid>eNotkLFOwzAQhi0EEqUw8QKWGFHgznFsZyyBlqIKhsJsmcQuqZI42Cmob0-qstzd8On0_x8h1wh3KUvT-5flfJ2gykGxEzJBJbNE5UKejjcIniDj4pxcxLgFQJkxmJBF0eziYEPdbajp-6YuzVD7LlLv6HL--LAuZq_UNBsf6uGrpb_jpKVvexNG7sdS05lmH-t4Sc6caaK9-t9T8jF_ei-ek9XbYlnMVknJBA6JRCbRmFLKrJJVLkC4T2GdkrLiyhhnmYKSI6J1aFllM6Gc4DnnwKUCI9IpuTn-7YP_3tk46K3fhTFE1GO5VCiALB-p2yNVBh9jsE73oW5N2GsEfTClD6b00VT6B49FWoQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2463680059</pqid></control><display><type>article</type><title>Clustering applications of IFDBSCAN algorithm with comparative analysis</title><source>Business Source Complete</source><creator>Unver, Mustafa ; Erginel, Nihal</creator><contributor>Kahraman, Cengiz</contributor><creatorcontrib>Unver, Mustafa ; Erginel, Nihal ; Kahraman, Cengiz</creatorcontrib><description>Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separates dense regions from low density regions and simultaneously assigns points that lie alone as outliers to unearth the hidden cluster patterns in the datasets. DBSCAN identifies dense regions by means of core point definition, detection of which are strictly dependent on input parameter definitions: ε is distance of the neighborhood or radius of hypersphere and MinPts is minimum density constraint inside ε radius hypersphere. Contrarily to classical DBSCAN’s crisp core point definition, intuitionistic fuzzy core point definition is proposed in our preliminary work to make DBSCAN algorithm capable of detecting different patterns of density by two different combinations of input parameters, particularly is a necessity for the density varying large datasets in multidimensional feature space. In this study, preliminarily proposed DBSCAN extension is studied: IFDBSCAN. The proposed extension is tested by computational experiments on several machine learning repository real-time datasets. Results show that, IFDBSCAN is superior to classical DBSCAN with respect to external & internal performance indices such as purity index, adjusted rand index, Fowlkes-Mallows score, silhouette coefficient, Calinski-Harabasz index and with respect to clustering structure results without increasing computational time so much, along with the possibility of trying two different density patterns on the same run and trying intermediary density values for the users by manipulating α margin.</description><identifier>ISSN: 1064-1246</identifier><identifier>EISSN: 1875-8967</identifier><identifier>DOI: 10.3233/JIFS-189082</identifier><language>eng</language><publisher>Amsterdam: IOS Press BV</publisher><subject>Algorithms ; Clustering ; Computing time ; Datasets ; Density ; Hyperspheres ; Machine learning ; Outliers (statistics) ; Parameters ; Performance indices</subject><ispartof>Journal of intelligent & fuzzy systems, 2020-01, Vol.39 (5), p.6099-6108</ispartof><rights>Copyright IOS Press BV 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c261t-71271aac775d7d9606fb6ef877d48aafe280c4111ef1e2de568f6494404780a63</citedby><cites>FETCH-LOGICAL-c261t-71271aac775d7d9606fb6ef877d48aafe280c4111ef1e2de568f6494404780a63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><contributor>Kahraman, Cengiz</contributor><creatorcontrib>Unver, Mustafa</creatorcontrib><creatorcontrib>Erginel, Nihal</creatorcontrib><title>Clustering applications of IFDBSCAN algorithm with comparative analysis</title><title>Journal of intelligent & fuzzy systems</title><description>Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separates dense regions from low density regions and simultaneously assigns points that lie alone as outliers to unearth the hidden cluster patterns in the datasets. DBSCAN identifies dense regions by means of core point definition, detection of which are strictly dependent on input parameter definitions: ε is distance of the neighborhood or radius of hypersphere and MinPts is minimum density constraint inside ε radius hypersphere. Contrarily to classical DBSCAN’s crisp core point definition, intuitionistic fuzzy core point definition is proposed in our preliminary work to make DBSCAN algorithm capable of detecting different patterns of density by two different combinations of input parameters, particularly is a necessity for the density varying large datasets in multidimensional feature space. In this study, preliminarily proposed DBSCAN extension is studied: IFDBSCAN. The proposed extension is tested by computational experiments on several machine learning repository real-time datasets. Results show that, IFDBSCAN is superior to classical DBSCAN with respect to external & internal performance indices such as purity index, adjusted rand index, Fowlkes-Mallows score, silhouette coefficient, Calinski-Harabasz index and with respect to clustering structure results without increasing computational time so much, along with the possibility of trying two different density patterns on the same run and trying intermediary density values for the users by manipulating α margin.</description><subject>Algorithms</subject><subject>Clustering</subject><subject>Computing time</subject><subject>Datasets</subject><subject>Density</subject><subject>Hyperspheres</subject><subject>Machine learning</subject><subject>Outliers (statistics)</subject><subject>Parameters</subject><subject>Performance indices</subject><issn>1064-1246</issn><issn>1875-8967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNotkLFOwzAQhi0EEqUw8QKWGFHgznFsZyyBlqIKhsJsmcQuqZI42Cmob0-qstzd8On0_x8h1wh3KUvT-5flfJ2gykGxEzJBJbNE5UKejjcIniDj4pxcxLgFQJkxmJBF0eziYEPdbajp-6YuzVD7LlLv6HL--LAuZq_UNBsf6uGrpb_jpKVvexNG7sdS05lmH-t4Sc6caaK9-t9T8jF_ei-ek9XbYlnMVknJBA6JRCbRmFLKrJJVLkC4T2GdkrLiyhhnmYKSI6J1aFllM6Gc4DnnwKUCI9IpuTn-7YP_3tk46K3fhTFE1GO5VCiALB-p2yNVBh9jsE73oW5N2GsEfTClD6b00VT6B49FWoQ</recordid><startdate>20200101</startdate><enddate>20200101</enddate><creator>Unver, Mustafa</creator><creator>Erginel, Nihal</creator><general>IOS Press BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20200101</creationdate><title>Clustering applications of IFDBSCAN algorithm with comparative analysis</title><author>Unver, Mustafa ; Erginel, Nihal</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c261t-71271aac775d7d9606fb6ef877d48aafe280c4111ef1e2de568f6494404780a63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Clustering</topic><topic>Computing time</topic><topic>Datasets</topic><topic>Density</topic><topic>Hyperspheres</topic><topic>Machine learning</topic><topic>Outliers (statistics)</topic><topic>Parameters</topic><topic>Performance indices</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Unver, Mustafa</creatorcontrib><creatorcontrib>Erginel, Nihal</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of intelligent & fuzzy systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Unver, Mustafa</au><au>Erginel, Nihal</au><au>Kahraman, Cengiz</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Clustering applications of IFDBSCAN algorithm with comparative analysis</atitle><jtitle>Journal of intelligent & fuzzy systems</jtitle><date>2020-01-01</date><risdate>2020</risdate><volume>39</volume><issue>5</issue><spage>6099</spage><epage>6108</epage><pages>6099-6108</pages><issn>1064-1246</issn><eissn>1875-8967</eissn><abstract>Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separates dense regions from low density regions and simultaneously assigns points that lie alone as outliers to unearth the hidden cluster patterns in the datasets. DBSCAN identifies dense regions by means of core point definition, detection of which are strictly dependent on input parameter definitions: ε is distance of the neighborhood or radius of hypersphere and MinPts is minimum density constraint inside ε radius hypersphere. Contrarily to classical DBSCAN’s crisp core point definition, intuitionistic fuzzy core point definition is proposed in our preliminary work to make DBSCAN algorithm capable of detecting different patterns of density by two different combinations of input parameters, particularly is a necessity for the density varying large datasets in multidimensional feature space. In this study, preliminarily proposed DBSCAN extension is studied: IFDBSCAN. The proposed extension is tested by computational experiments on several machine learning repository real-time datasets. Results show that, IFDBSCAN is superior to classical DBSCAN with respect to external & internal performance indices such as purity index, adjusted rand index, Fowlkes-Mallows score, silhouette coefficient, Calinski-Harabasz index and with respect to clustering structure results without increasing computational time so much, along with the possibility of trying two different density patterns on the same run and trying intermediary density values for the users by manipulating α margin.</abstract><cop>Amsterdam</cop><pub>IOS Press BV</pub><doi>10.3233/JIFS-189082</doi><tpages>10</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1064-1246
ispartof	Journal of intelligent & fuzzy systems, 2020-01, Vol.39 (5), p.6099-6108
issn	1064-1246 1875-8967
language	eng
recordid	cdi_proquest_journals_2463680059
source	Business Source Complete
subjects	Algorithms Clustering Computing time Datasets Density Hyperspheres Machine learning Outliers (statistics) Parameters Performance indices
title	Clustering applications of IFDBSCAN algorithm with comparative analysis
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T08%3A02%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Clustering%20applications%20of%20IFDBSCAN%20algorithm%20with%20comparative%20analysis&rft.jtitle=Journal%20of%20intelligent%20&%20fuzzy%20systems&rft.au=Unver,%20Mustafa&rft.date=2020-01-01&rft.volume=39&rft.issue=5&rft.spage=6099&rft.epage=6108&rft.pages=6099-6108&rft.issn=1064-1246&rft.eissn=1875-8967&rft_id=info:doi/10.3233/JIFS-189082&rft_dat=%3Cproquest_cross%3E2463680059%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2463680059&rft_id=info:pmid/&rfr_iscdi=true