A Novel Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance

Expectation Maximization (EM) is a widely employed mixture model-based data clustering algorithm and produces exceptionally good results. However, many researchers reported that the EM algorithm requires huge computational efforts than other clustering algorithms. This paper presents an algorithm fo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of ambient computing and intelligence 2016-07, Vol.7 (2), p.47-74
Hauptverfasser: Kishor, Duggirala Raja, Venkateswarlu, N.B
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 74
container_issue 2
container_start_page 47
container_title International journal of ambient computing and intelligence
container_volume 7
creator Kishor, Duggirala Raja
Venkateswarlu, N.B
description Expectation Maximization (EM) is a widely employed mixture model-based data clustering algorithm and produces exceptionally good results. However, many researchers reported that the EM algorithm requires huge computational efforts than other clustering algorithms. This paper presents an algorithm for the novel hybridization of EM and K-Means techniques for achieving better clustering performance (NovHbEMKM). This algorithm first performs K-Means and then using these results it performs EM and K-Means in the alternative iterations. Along with the NovHbEMKM, experiments are carried out with the algorithms for EM, EM using the results of K-Means and Cluster package of Purdue University. Experiments are carried out with datasets from UCI ML repository and synthetic datasets. Execution time, Clustering Fitness and Sum of Squared Errors (SSE) are computed as performance criteria. In all the experiments the proposed NovHbEMKM algorithm is taking less execution time by producing results with higher clustering fitness and lesser SSE than other algorithms including the Cluster package.
doi_str_mv 10.4018/IJACI.2016070103
format Article
fullrecord <record><control><sourceid>gale_cross</sourceid><recordid>TN_cdi_gale_businessinsightsgauss_A759354165</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A759354165</galeid><sourcerecordid>A759354165</sourcerecordid><originalsourceid>FETCH-LOGICAL-c404t-31b7beeeb867b0df0556a8bfdbc6a306a0c86d00a2edad02aea014244f7beba53</originalsourceid><addsrcrecordid>eNp1UU1P3DAQjVCRSoF7j5a49NDAOHGc7DGs-Fi-D_RsjZNx8CobL3aCgF9PlqWgonKZGWveex69F0U_OewL4MXB7KyczvYT4BJy4JBuRFt8IngsE5F9e5_T_Hv0I4Q5gMwgy7eiecmu3AO17PRJe1vbZ-yt65gz7OhxSVX_-owv8dEu_u6wq9l5fEnYBVa2jfO2v1sEZpxnh9T35Nm0HcLYbdewG_LjYoFdRTvRpsE20O5b347-HB_dTk_ji-uT2bS8iCsBoo9TrnNNRLqQuYbaQJZJLLSpdSUxBYlQFbIGwIRqrCFBQuAiEcKMNI1Zuh3trXWX3t0PFHo1d4Pvxi9VMkl5IUXC8xH1e41qsCWlh2A7CmMJtrnrQ4NDCKrMs0maCS5XorCGV96F4MmopbcL9E-Kg1oFoF4DUB8BjJTjNcU29uOE0Wv1j9fKGbXy-iud1am__iP0GaeWtUlfAOZGnqs</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2931864217</pqid></control><display><type>article</type><title>A Novel Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance</title><source>ProQuest Central UK/Ireland</source><source>ProQuest Central</source><creator>Kishor, Duggirala Raja ; Venkateswarlu, N.B</creator><creatorcontrib>Kishor, Duggirala Raja ; Venkateswarlu, N.B</creatorcontrib><description>Expectation Maximization (EM) is a widely employed mixture model-based data clustering algorithm and produces exceptionally good results. However, many researchers reported that the EM algorithm requires huge computational efforts than other clustering algorithms. This paper presents an algorithm for the novel hybridization of EM and K-Means techniques for achieving better clustering performance (NovHbEMKM). This algorithm first performs K-Means and then using these results it performs EM and K-Means in the alternative iterations. Along with the NovHbEMKM, experiments are carried out with the algorithms for EM, EM using the results of K-Means and Cluster package of Purdue University. Experiments are carried out with datasets from UCI ML repository and synthetic datasets. Execution time, Clustering Fitness and Sum of Squared Errors (SSE) are computed as performance criteria. In all the experiments the proposed NovHbEMKM algorithm is taking less execution time by producing results with higher clustering fitness and lesser SSE than other algorithms including the Cluster package.</description><identifier>ISSN: 1941-6237</identifier><identifier>EISSN: 1941-6245</identifier><identifier>DOI: 10.4018/IJACI.2016070103</identifier><language>eng</language><publisher>Hershey: IGI Global</publisher><subject>Algorithms ; Clustering ; Computational linguistics ; Datasets ; Language processing ; Maximization ; Natural language interfaces ; Optimization ; Synthetic data</subject><ispartof>International journal of ambient computing and intelligence, 2016-07, Vol.7 (2), p.47-74</ispartof><rights>COPYRIGHT 2016 IGI Global</rights><rights>Copyright © 2016, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c404t-31b7beeeb867b0df0556a8bfdbc6a306a0c86d00a2edad02aea014244f7beba53</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2931864217?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,21388,27924,27925,33744,43805,64385,64389,72469</link.rule.ids></links><search><creatorcontrib>Kishor, Duggirala Raja</creatorcontrib><creatorcontrib>Venkateswarlu, N.B</creatorcontrib><title>A Novel Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance</title><title>International journal of ambient computing and intelligence</title><description>Expectation Maximization (EM) is a widely employed mixture model-based data clustering algorithm and produces exceptionally good results. However, many researchers reported that the EM algorithm requires huge computational efforts than other clustering algorithms. This paper presents an algorithm for the novel hybridization of EM and K-Means techniques for achieving better clustering performance (NovHbEMKM). This algorithm first performs K-Means and then using these results it performs EM and K-Means in the alternative iterations. Along with the NovHbEMKM, experiments are carried out with the algorithms for EM, EM using the results of K-Means and Cluster package of Purdue University. Experiments are carried out with datasets from UCI ML repository and synthetic datasets. Execution time, Clustering Fitness and Sum of Squared Errors (SSE) are computed as performance criteria. In all the experiments the proposed NovHbEMKM algorithm is taking less execution time by producing results with higher clustering fitness and lesser SSE than other algorithms including the Cluster package.</description><subject>Algorithms</subject><subject>Clustering</subject><subject>Computational linguistics</subject><subject>Datasets</subject><subject>Language processing</subject><subject>Maximization</subject><subject>Natural language interfaces</subject><subject>Optimization</subject><subject>Synthetic data</subject><issn>1941-6237</issn><issn>1941-6245</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>N95</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp1UU1P3DAQjVCRSoF7j5a49NDAOHGc7DGs-Fi-D_RsjZNx8CobL3aCgF9PlqWgonKZGWveex69F0U_OewL4MXB7KyczvYT4BJy4JBuRFt8IngsE5F9e5_T_Hv0I4Q5gMwgy7eiecmu3AO17PRJe1vbZ-yt65gz7OhxSVX_-owv8dEu_u6wq9l5fEnYBVa2jfO2v1sEZpxnh9T35Nm0HcLYbdewG_LjYoFdRTvRpsE20O5b347-HB_dTk_ji-uT2bS8iCsBoo9TrnNNRLqQuYbaQJZJLLSpdSUxBYlQFbIGwIRqrCFBQuAiEcKMNI1Zuh3trXWX3t0PFHo1d4Pvxi9VMkl5IUXC8xH1e41qsCWlh2A7CmMJtrnrQ4NDCKrMs0maCS5XorCGV96F4MmopbcL9E-Kg1oFoF4DUB8BjJTjNcU29uOE0Wv1j9fKGbXy-iud1am__iP0GaeWtUlfAOZGnqs</recordid><startdate>20160701</startdate><enddate>20160701</enddate><creator>Kishor, Duggirala Raja</creator><creator>Venkateswarlu, N.B</creator><general>IGI Global</general><scope>AAYXX</scope><scope>CITATION</scope><scope>N95</scope><scope>7SC</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M7S</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20160701</creationdate><title>A Novel Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance</title><author>Kishor, Duggirala Raja ; Venkateswarlu, N.B</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c404t-31b7beeeb867b0df0556a8bfdbc6a306a0c86d00a2edad02aea014244f7beba53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Algorithms</topic><topic>Clustering</topic><topic>Computational linguistics</topic><topic>Datasets</topic><topic>Language processing</topic><topic>Maximization</topic><topic>Natural language interfaces</topic><topic>Optimization</topic><topic>Synthetic data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kishor, Duggirala Raja</creatorcontrib><creatorcontrib>Venkateswarlu, N.B</creatorcontrib><collection>CrossRef</collection><collection>Gale Business: Insights</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Engineering Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><jtitle>International journal of ambient computing and intelligence</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kishor, Duggirala Raja</au><au>Venkateswarlu, N.B</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Novel Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance</atitle><jtitle>International journal of ambient computing and intelligence</jtitle><date>2016-07-01</date><risdate>2016</risdate><volume>7</volume><issue>2</issue><spage>47</spage><epage>74</epage><pages>47-74</pages><issn>1941-6237</issn><eissn>1941-6245</eissn><abstract>Expectation Maximization (EM) is a widely employed mixture model-based data clustering algorithm and produces exceptionally good results. However, many researchers reported that the EM algorithm requires huge computational efforts than other clustering algorithms. This paper presents an algorithm for the novel hybridization of EM and K-Means techniques for achieving better clustering performance (NovHbEMKM). This algorithm first performs K-Means and then using these results it performs EM and K-Means in the alternative iterations. Along with the NovHbEMKM, experiments are carried out with the algorithms for EM, EM using the results of K-Means and Cluster package of Purdue University. Experiments are carried out with datasets from UCI ML repository and synthetic datasets. Execution time, Clustering Fitness and Sum of Squared Errors (SSE) are computed as performance criteria. In all the experiments the proposed NovHbEMKM algorithm is taking less execution time by producing results with higher clustering fitness and lesser SSE than other algorithms including the Cluster package.</abstract><cop>Hershey</cop><pub>IGI Global</pub><doi>10.4018/IJACI.2016070103</doi><tpages>28</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1941-6237
ispartof International journal of ambient computing and intelligence, 2016-07, Vol.7 (2), p.47-74
issn 1941-6237
1941-6245
language eng
recordid cdi_gale_businessinsightsgauss_A759354165
source ProQuest Central UK/Ireland; ProQuest Central
subjects Algorithms
Clustering
Computational linguistics
Datasets
Language processing
Maximization
Natural language interfaces
Optimization
Synthetic data
title A Novel Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T07%3A47%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Novel%20Hybridization%20of%20Expectation-Maximization%20and%20K-Means%20Algorithms%20for%20Better%20Clustering%20Performance&rft.jtitle=International%20journal%20of%20ambient%20computing%20and%20intelligence&rft.au=Kishor,%20Duggirala%20Raja&rft.date=2016-07-01&rft.volume=7&rft.issue=2&rft.spage=47&rft.epage=74&rft.pages=47-74&rft.issn=1941-6237&rft.eissn=1941-6245&rft_id=info:doi/10.4018/IJACI.2016070103&rft_dat=%3Cgale_cross%3EA759354165%3C/gale_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2931864217&rft_id=info:pmid/&rft_galeid=A759354165&rfr_iscdi=true