PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks

•A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel m...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2021-11, Vol.183, p.115377, Article 115377
Hauptverfasser: Roghani, Hamid, Bouyer, Asgarali, Nourani, Esmaeil
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 115377
container_title Expert systems with applications
container_volume 183
creator Roghani, Hamid
Bouyer, Asgarali
Nourani, Esmaeil
description •A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel merge phase is utilized to obtain more dense and accurate communities.•The result of PLDLS completely is robust, stable and scalable in comparison with other examined methods. Parallel and distributed community detection in large-scale complex networks, such as social networks, is a challenging task. Parallel and distributed algorithm with high accuracy and low computational complexity is one of the essential issues in the community detection field. In this paper, we propose a novel fast, and accurate Spark-based parallel label diffusion and label selection-based (PLDLS) community detection algorithm with two-step of label diffusion of core nodes along with a new label selection (propagation) method. We have used multi-factor criteria for computing node's importance and adopted a new method for selecting core nodes. In the first phase, utilizing the fact that nodes forming triangles, tend to be in the same community, parallel label diffusion of core nodes is performed to diffuse labels up to two levels. In the second phase, through an iterative and parallel process, the most appropriate labels are assigned to the remaining nodes. PLDLS proposes an improved robust version of LPA by putting aside randomness parameter tuning. Furthermore, we utilize a fast and parallel merge phase to get even more dense and accurate communities. Conducted experiments on real-world and artificial networks, indicates the better accuracy and low execution time of PLDLS in comparison with other examined methods.
doi_str_mv 10.1016/j.eswa.2021.115377
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2579413735</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417421008034</els_id><sourcerecordid>2579413735</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-a001f9b63346c553f5748a462b96c1251c738002ed7ed5cf97af37c46c403ec43</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhC0EEqXwApwscU6w4zhuEJeq_EqVQCqcLdfZgFsnLnbSqkfeHFfpmct6NZ7ZXX0IXVOSUkKL21UKYafSjGQ0pZQzIU7QiE4ESwpRslM0IiUXSU5Ffo4uQlgRQgUhYoR-3-cP88UdnuLWbcHijfLK2thYtYy1MnXdB-NarNrqqC3Agu6ilixVgApr1zR9a7o9rqAbfrCyX86b7rvBgydKizh6jU2Lg9NGWdxCt3N-HS7RWa1sgKvjO0afT48fs5dk_vb8OpvOE82ySZeoeHJdLgvG8kJzzmou8onKi2xZFppmnGrBJoRkUAmouK5LoWomdDTnhIHO2RjdDHM33v30EDq5cr1v40qZcVHmlAnGoysbXNq7EDzUcuNNo_xeUiIPqOVKHlDLA2o5oI6h-yEE8f6tAS-DNtBqqIyPQGTlzH_xP-MeiBY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2579413735</pqid></control><display><type>article</type><title>PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks</title><source>Elsevier ScienceDirect Journals</source><creator>Roghani, Hamid ; Bouyer, Asgarali ; Nourani, Esmaeil</creator><creatorcontrib>Roghani, Hamid ; Bouyer, Asgarali ; Nourani, Esmaeil</creatorcontrib><description>•A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel merge phase is utilized to obtain more dense and accurate communities.•The result of PLDLS completely is robust, stable and scalable in comparison with other examined methods. Parallel and distributed community detection in large-scale complex networks, such as social networks, is a challenging task. Parallel and distributed algorithm with high accuracy and low computational complexity is one of the essential issues in the community detection field. In this paper, we propose a novel fast, and accurate Spark-based parallel label diffusion and label selection-based (PLDLS) community detection algorithm with two-step of label diffusion of core nodes along with a new label selection (propagation) method. We have used multi-factor criteria for computing node's importance and adopted a new method for selecting core nodes. In the first phase, utilizing the fact that nodes forming triangles, tend to be in the same community, parallel label diffusion of core nodes is performed to diffuse labels up to two levels. In the second phase, through an iterative and parallel process, the most appropriate labels are assigned to the remaining nodes. PLDLS proposes an improved robust version of LPA by putting aside randomness parameter tuning. Furthermore, we utilize a fast and parallel merge phase to get even more dense and accurate communities. Conducted experiments on real-world and artificial networks, indicates the better accuracy and low execution time of PLDLS in comparison with other examined methods.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2021.115377</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Algorithms ; Complexity ; Diffusion rate ; Iterative methods ; Label diffusion ; Label selection ; Labels ; Local similarity ; Nodes ; Parallel community detection ; Social networks ; Spark</subject><ispartof>Expert systems with applications, 2021-11, Vol.183, p.115377, Article 115377</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Nov 30, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-a001f9b63346c553f5748a462b96c1251c738002ed7ed5cf97af37c46c403ec43</citedby><cites>FETCH-LOGICAL-c328t-a001f9b63346c553f5748a462b96c1251c738002ed7ed5cf97af37c46c403ec43</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0957417421008034$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65534</link.rule.ids></links><search><creatorcontrib>Roghani, Hamid</creatorcontrib><creatorcontrib>Bouyer, Asgarali</creatorcontrib><creatorcontrib>Nourani, Esmaeil</creatorcontrib><title>PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks</title><title>Expert systems with applications</title><description>•A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel merge phase is utilized to obtain more dense and accurate communities.•The result of PLDLS completely is robust, stable and scalable in comparison with other examined methods. Parallel and distributed community detection in large-scale complex networks, such as social networks, is a challenging task. Parallel and distributed algorithm with high accuracy and low computational complexity is one of the essential issues in the community detection field. In this paper, we propose a novel fast, and accurate Spark-based parallel label diffusion and label selection-based (PLDLS) community detection algorithm with two-step of label diffusion of core nodes along with a new label selection (propagation) method. We have used multi-factor criteria for computing node's importance and adopted a new method for selecting core nodes. In the first phase, utilizing the fact that nodes forming triangles, tend to be in the same community, parallel label diffusion of core nodes is performed to diffuse labels up to two levels. In the second phase, through an iterative and parallel process, the most appropriate labels are assigned to the remaining nodes. PLDLS proposes an improved robust version of LPA by putting aside randomness parameter tuning. Furthermore, we utilize a fast and parallel merge phase to get even more dense and accurate communities. Conducted experiments on real-world and artificial networks, indicates the better accuracy and low execution time of PLDLS in comparison with other examined methods.</description><subject>Algorithms</subject><subject>Complexity</subject><subject>Diffusion rate</subject><subject>Iterative methods</subject><subject>Label diffusion</subject><subject>Label selection</subject><subject>Labels</subject><subject>Local similarity</subject><subject>Nodes</subject><subject>Parallel community detection</subject><subject>Social networks</subject><subject>Spark</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhC0EEqXwApwscU6w4zhuEJeq_EqVQCqcLdfZgFsnLnbSqkfeHFfpmct6NZ7ZXX0IXVOSUkKL21UKYafSjGQ0pZQzIU7QiE4ESwpRslM0IiUXSU5Ffo4uQlgRQgUhYoR-3-cP88UdnuLWbcHijfLK2thYtYy1MnXdB-NarNrqqC3Agu6ilixVgApr1zR9a7o9rqAbfrCyX86b7rvBgydKizh6jU2Lg9NGWdxCt3N-HS7RWa1sgKvjO0afT48fs5dk_vb8OpvOE82ySZeoeHJdLgvG8kJzzmou8onKi2xZFppmnGrBJoRkUAmouK5LoWomdDTnhIHO2RjdDHM33v30EDq5cr1v40qZcVHmlAnGoysbXNq7EDzUcuNNo_xeUiIPqOVKHlDLA2o5oI6h-yEE8f6tAS-DNtBqqIyPQGTlzH_xP-MeiBY</recordid><startdate>20211130</startdate><enddate>20211130</enddate><creator>Roghani, Hamid</creator><creator>Bouyer, Asgarali</creator><creator>Nourani, Esmaeil</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20211130</creationdate><title>PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks</title><author>Roghani, Hamid ; Bouyer, Asgarali ; Nourani, Esmaeil</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-a001f9b63346c553f5748a462b96c1251c738002ed7ed5cf97af37c46c403ec43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Complexity</topic><topic>Diffusion rate</topic><topic>Iterative methods</topic><topic>Label diffusion</topic><topic>Label selection</topic><topic>Labels</topic><topic>Local similarity</topic><topic>Nodes</topic><topic>Parallel community detection</topic><topic>Social networks</topic><topic>Spark</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Roghani, Hamid</creatorcontrib><creatorcontrib>Bouyer, Asgarali</creatorcontrib><creatorcontrib>Nourani, Esmaeil</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Roghani, Hamid</au><au>Bouyer, Asgarali</au><au>Nourani, Esmaeil</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks</atitle><jtitle>Expert systems with applications</jtitle><date>2021-11-30</date><risdate>2021</risdate><volume>183</volume><spage>115377</spage><pages>115377-</pages><artnum>115377</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel merge phase is utilized to obtain more dense and accurate communities.•The result of PLDLS completely is robust, stable and scalable in comparison with other examined methods. Parallel and distributed community detection in large-scale complex networks, such as social networks, is a challenging task. Parallel and distributed algorithm with high accuracy and low computational complexity is one of the essential issues in the community detection field. In this paper, we propose a novel fast, and accurate Spark-based parallel label diffusion and label selection-based (PLDLS) community detection algorithm with two-step of label diffusion of core nodes along with a new label selection (propagation) method. We have used multi-factor criteria for computing node's importance and adopted a new method for selecting core nodes. In the first phase, utilizing the fact that nodes forming triangles, tend to be in the same community, parallel label diffusion of core nodes is performed to diffuse labels up to two levels. In the second phase, through an iterative and parallel process, the most appropriate labels are assigned to the remaining nodes. PLDLS proposes an improved robust version of LPA by putting aside randomness parameter tuning. Furthermore, we utilize a fast and parallel merge phase to get even more dense and accurate communities. Conducted experiments on real-world and artificial networks, indicates the better accuracy and low execution time of PLDLS in comparison with other examined methods.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2021.115377</doi></addata></record>
fulltext fulltext
identifier ISSN: 0957-4174
ispartof Expert systems with applications, 2021-11, Vol.183, p.115377, Article 115377
issn 0957-4174
1873-6793
language eng
recordid cdi_proquest_journals_2579413735
source Elsevier ScienceDirect Journals
subjects Algorithms
Complexity
Diffusion rate
Iterative methods
Label diffusion
Label selection
Labels
Local similarity
Nodes
Parallel community detection
Social networks
Spark
title PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-21T00%3A43%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=PLDLS:%20A%20novel%20parallel%20label%20diffusion%20and%20label%20Selection-based%20community%20detection%20algorithm%20based%20on%20Spark%20in%20social%20networks&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Roghani,%20Hamid&rft.date=2021-11-30&rft.volume=183&rft.spage=115377&rft.pages=115377-&rft.artnum=115377&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2021.115377&rft_dat=%3Cproquest_cross%3E2579413735%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2579413735&rft_id=info:pmid/&rft_els_id=S0957417421008034&rfr_iscdi=true