Neighborhood search based improved bat algorithm for data clustering

Clustering is an unsupervised data analytic technique that can determine the similarity between data objects and put the similar data objects into one cluster. The similarity among data objects is determined through some distance function. It is observed that clustering technique gains wide populari...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied intelligence (Dordrecht, Netherlands) Netherlands), 2022-07, Vol.52 (9), p.10541-10575
Hauptverfasser: Kaur, Arvinder, Kumar, Yugal
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 10575
container_issue 9
container_start_page 10541
container_title Applied intelligence (Dordrecht, Netherlands)
container_volume 52
creator Kaur, Arvinder
Kumar, Yugal
description Clustering is an unsupervised data analytic technique that can determine the similarity between data objects and put the similar data objects into one cluster. The similarity among data objects is determined through some distance function. It is observed that clustering technique gains wide popularity due to its unsupervised and can be used in diverse research filed such as image segmentation, data analytics, outlier detection, and so on. This work focuses on the data clustering problems and proposes a new clustering algorithm based on the behavior of micro-bats. The proposed bat algorithm to determine the optimal cluster center for data clustering problems. It is also observed that several shortcomings are associated with bat algorithm such as slow convergence rate, local optima, and trade-off among search mechanisms. The slow convergence issue is addressed through an elitist mechanism. While an enhanced cooperative method is introduced for handling population initialization issues. In this work, a Q-learning based neighbourhood search mechanism is also developed to effectively overcome the local optima issue. Several benchmark non-healthcare and healthcare datasets are selected for evaluating the performance of the proposed bat algorithm. The simulation results are evaluated using intracluster distance, standard deviation, accuracy, and rand index parameters and compared with nineteen existing meta-heuristic algorithms. It is observed that the proposed bat algorithm obtains significant results with these datasets.
doi_str_mv 10.1007/s10489-021-02934-x
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2678581201</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2678581201</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-c4a20c973a2e459d9f9c2d53d9dd917e24250131a758918d7c5200746f45bfa83</originalsourceid><addsrcrecordid>eNp9kE1LAzEQhoMoWKt_wFPAczSTj83mKPUTil4UvIVsku1uaZuabKX-e6MrePMwzBzmfWfeB6FzoJdAqbrKQEWtCWVQSnNB9gdoAlJxooRWh2hCNROkqvTbMTrJeUkp5ZzCBN08hX7RNTF1MXqcg02uw43NweN-vU3xowyNHbBdLWLqh26N25iwt4PFbrXLQ0j9ZnGKjlq7yuHst0_R693ty-yBzJ_vH2fXc-I46IE4YRl1WnHLgpDa61Y75iX32nsNKjDBJAUOVslaQ-2Vk6yEE1UrZNPamk_RxehbHnvfhTyYZdylTTlpWKVqWQMr-ili45ZLMecUWrNN_dqmTwPUfNMyIy1TaJkfWmZfRHwU5e13opD-rP9RfQFSK2z9</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2678581201</pqid></control><display><type>article</type><title>Neighborhood search based improved bat algorithm for data clustering</title><source>Springer Nature - Complete Springer Journals</source><creator>Kaur, Arvinder ; Kumar, Yugal</creator><creatorcontrib>Kaur, Arvinder ; Kumar, Yugal</creatorcontrib><description>Clustering is an unsupervised data analytic technique that can determine the similarity between data objects and put the similar data objects into one cluster. The similarity among data objects is determined through some distance function. It is observed that clustering technique gains wide popularity due to its unsupervised and can be used in diverse research filed such as image segmentation, data analytics, outlier detection, and so on. This work focuses on the data clustering problems and proposes a new clustering algorithm based on the behavior of micro-bats. The proposed bat algorithm to determine the optimal cluster center for data clustering problems. It is also observed that several shortcomings are associated with bat algorithm such as slow convergence rate, local optima, and trade-off among search mechanisms. The slow convergence issue is addressed through an elitist mechanism. While an enhanced cooperative method is introduced for handling population initialization issues. In this work, a Q-learning based neighbourhood search mechanism is also developed to effectively overcome the local optima issue. Several benchmark non-healthcare and healthcare datasets are selected for evaluating the performance of the proposed bat algorithm. The simulation results are evaluated using intracluster distance, standard deviation, accuracy, and rand index parameters and compared with nineteen existing meta-heuristic algorithms. It is observed that the proposed bat algorithm obtains significant results with these datasets.</description><identifier>ISSN: 0924-669X</identifier><identifier>EISSN: 1573-7497</identifier><identifier>DOI: 10.1007/s10489-021-02934-x</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Artificial Intelligence ; Clustering ; Computer Science ; Convergence ; Data analysis ; Datasets ; Health care ; Heuristic methods ; Image segmentation ; Machine learning ; Machines ; Manufacturing ; Mechanical Engineering ; Outliers (statistics) ; Performance evaluation ; Processes ; Searching ; Similarity</subject><ispartof>Applied intelligence (Dordrecht, Netherlands), 2022-07, Vol.52 (9), p.10541-10575</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022</rights><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-c4a20c973a2e459d9f9c2d53d9dd917e24250131a758918d7c5200746f45bfa83</citedby><cites>FETCH-LOGICAL-c319t-c4a20c973a2e459d9f9c2d53d9dd917e24250131a758918d7c5200746f45bfa83</cites><orcidid>0000-0003-3451-4897</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10489-021-02934-x$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10489-021-02934-x$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,41467,42536,51297</link.rule.ids></links><search><creatorcontrib>Kaur, Arvinder</creatorcontrib><creatorcontrib>Kumar, Yugal</creatorcontrib><title>Neighborhood search based improved bat algorithm for data clustering</title><title>Applied intelligence (Dordrecht, Netherlands)</title><addtitle>Appl Intell</addtitle><description>Clustering is an unsupervised data analytic technique that can determine the similarity between data objects and put the similar data objects into one cluster. The similarity among data objects is determined through some distance function. It is observed that clustering technique gains wide popularity due to its unsupervised and can be used in diverse research filed such as image segmentation, data analytics, outlier detection, and so on. This work focuses on the data clustering problems and proposes a new clustering algorithm based on the behavior of micro-bats. The proposed bat algorithm to determine the optimal cluster center for data clustering problems. It is also observed that several shortcomings are associated with bat algorithm such as slow convergence rate, local optima, and trade-off among search mechanisms. The slow convergence issue is addressed through an elitist mechanism. While an enhanced cooperative method is introduced for handling population initialization issues. In this work, a Q-learning based neighbourhood search mechanism is also developed to effectively overcome the local optima issue. Several benchmark non-healthcare and healthcare datasets are selected for evaluating the performance of the proposed bat algorithm. The simulation results are evaluated using intracluster distance, standard deviation, accuracy, and rand index parameters and compared with nineteen existing meta-heuristic algorithms. It is observed that the proposed bat algorithm obtains significant results with these datasets.</description><subject>Algorithms</subject><subject>Artificial Intelligence</subject><subject>Clustering</subject><subject>Computer Science</subject><subject>Convergence</subject><subject>Data analysis</subject><subject>Datasets</subject><subject>Health care</subject><subject>Heuristic methods</subject><subject>Image segmentation</subject><subject>Machine learning</subject><subject>Machines</subject><subject>Manufacturing</subject><subject>Mechanical Engineering</subject><subject>Outliers (statistics)</subject><subject>Performance evaluation</subject><subject>Processes</subject><subject>Searching</subject><subject>Similarity</subject><issn>0924-669X</issn><issn>1573-7497</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kE1LAzEQhoMoWKt_wFPAczSTj83mKPUTil4UvIVsku1uaZuabKX-e6MrePMwzBzmfWfeB6FzoJdAqbrKQEWtCWVQSnNB9gdoAlJxooRWh2hCNROkqvTbMTrJeUkp5ZzCBN08hX7RNTF1MXqcg02uw43NweN-vU3xowyNHbBdLWLqh26N25iwt4PFbrXLQ0j9ZnGKjlq7yuHst0_R693ty-yBzJ_vH2fXc-I46IE4YRl1WnHLgpDa61Y75iX32nsNKjDBJAUOVslaQ-2Vk6yEE1UrZNPamk_RxehbHnvfhTyYZdylTTlpWKVqWQMr-ili45ZLMecUWrNN_dqmTwPUfNMyIy1TaJkfWmZfRHwU5e13opD-rP9RfQFSK2z9</recordid><startdate>20220701</startdate><enddate>20220701</enddate><creator>Kaur, Arvinder</creator><creator>Kumar, Yugal</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PSYQQ</scope><scope>PTHSS</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0003-3451-4897</orcidid></search><sort><creationdate>20220701</creationdate><title>Neighborhood search based improved bat algorithm for data clustering</title><author>Kaur, Arvinder ; Kumar, Yugal</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-c4a20c973a2e459d9f9c2d53d9dd917e24250131a758918d7c5200746f45bfa83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Algorithms</topic><topic>Artificial Intelligence</topic><topic>Clustering</topic><topic>Computer Science</topic><topic>Convergence</topic><topic>Data analysis</topic><topic>Datasets</topic><topic>Health care</topic><topic>Heuristic methods</topic><topic>Image segmentation</topic><topic>Machine learning</topic><topic>Machines</topic><topic>Manufacturing</topic><topic>Mechanical Engineering</topic><topic>Outliers (statistics)</topic><topic>Performance evaluation</topic><topic>Processes</topic><topic>Searching</topic><topic>Similarity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kaur, Arvinder</creatorcontrib><creatorcontrib>Kumar, Yugal</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest One Psychology</collection><collection>Engineering Collection</collection><collection>ProQuest Central Basic</collection><jtitle>Applied intelligence (Dordrecht, Netherlands)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kaur, Arvinder</au><au>Kumar, Yugal</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Neighborhood search based improved bat algorithm for data clustering</atitle><jtitle>Applied intelligence (Dordrecht, Netherlands)</jtitle><stitle>Appl Intell</stitle><date>2022-07-01</date><risdate>2022</risdate><volume>52</volume><issue>9</issue><spage>10541</spage><epage>10575</epage><pages>10541-10575</pages><issn>0924-669X</issn><eissn>1573-7497</eissn><abstract>Clustering is an unsupervised data analytic technique that can determine the similarity between data objects and put the similar data objects into one cluster. The similarity among data objects is determined through some distance function. It is observed that clustering technique gains wide popularity due to its unsupervised and can be used in diverse research filed such as image segmentation, data analytics, outlier detection, and so on. This work focuses on the data clustering problems and proposes a new clustering algorithm based on the behavior of micro-bats. The proposed bat algorithm to determine the optimal cluster center for data clustering problems. It is also observed that several shortcomings are associated with bat algorithm such as slow convergence rate, local optima, and trade-off among search mechanisms. The slow convergence issue is addressed through an elitist mechanism. While an enhanced cooperative method is introduced for handling population initialization issues. In this work, a Q-learning based neighbourhood search mechanism is also developed to effectively overcome the local optima issue. Several benchmark non-healthcare and healthcare datasets are selected for evaluating the performance of the proposed bat algorithm. The simulation results are evaluated using intracluster distance, standard deviation, accuracy, and rand index parameters and compared with nineteen existing meta-heuristic algorithms. It is observed that the proposed bat algorithm obtains significant results with these datasets.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10489-021-02934-x</doi><tpages>35</tpages><orcidid>https://orcid.org/0000-0003-3451-4897</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0924-669X
ispartof Applied intelligence (Dordrecht, Netherlands), 2022-07, Vol.52 (9), p.10541-10575
issn 0924-669X
1573-7497
language eng
recordid cdi_proquest_journals_2678581201
source Springer Nature - Complete Springer Journals
subjects Algorithms
Artificial Intelligence
Clustering
Computer Science
Convergence
Data analysis
Datasets
Health care
Heuristic methods
Image segmentation
Machine learning
Machines
Manufacturing
Mechanical Engineering
Outliers (statistics)
Performance evaluation
Processes
Searching
Similarity
title Neighborhood search based improved bat algorithm for data clustering
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T05%3A17%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Neighborhood%20search%20based%20improved%20bat%20algorithm%20for%20data%20clustering&rft.jtitle=Applied%20intelligence%20(Dordrecht,%20Netherlands)&rft.au=Kaur,%20Arvinder&rft.date=2022-07-01&rft.volume=52&rft.issue=9&rft.spage=10541&rft.epage=10575&rft.pages=10541-10575&rft.issn=0924-669X&rft.eissn=1573-7497&rft_id=info:doi/10.1007/s10489-021-02934-x&rft_dat=%3Cproquest_cross%3E2678581201%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2678581201&rft_id=info:pmid/&rfr_iscdi=true